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Description 

Technical Field 

5 [0001] The invention relates to materials and methodologies for managing the spread of non-A, non-B hepatitis virus 
(NANBV) infection. More specifically, it relates to an etiologic agent of non-A, non-B hepatitis (NANBH), hepatitis C 
virus (HCV), and to polynucleotides and analogs thereof, which are useful in assays for the detection of HCV in biological 
samples. 

10 References Cited in the Application 
[0002] 

Barr et al. (1986), Biotechniques 4:428. 
15 Beaucage et al. (1981), Tetrahedron Letters 22:1859. 

Botstein (1979), Gene 8:17. 

Brinton, MA (1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat 
and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum Press), p. 327-374. 

Broach (1981) in: Molecular Biology of the Yeast Saccharomyces, Vol. 1, p.445, Cold Spring Harbor Press. 
20 Broach et al. (1983), Meth. Enz. 101:307. 

Brown et al. (1979), Methods in Enzymology 68:109. 

Byrne et al. (1988), Nucleic Acids Res. 16:4165. 

Castle et al. (1986), Virology 119:10. 

Chang et al. (1977), Nature 198:1056. 
25 Chirgwin et al. (1979), Biochemistry 18:5294. 

Choo et al. (1989), Science 244:359. 

Chomczynski and Sacchi (1987), Analytical Biochemistry 162:156. 

Clewell et al. (1969), Proc. Natl. Acad. Sci. USA 62:1159. 

Clewed (1972), J. Bacteriol. 110:667. 
30 Cohen (1972), Proc. Natl. Acad. Sci. USA 69:2110. 

Cousens et al. (1987), Gene 61:265. 

De Boer et al. (1983), Proc. Natl. Acad. Sci. USA 292:128. 

Dreesman et al. (1985), J. Infect. Disease 151:761. 

Feinstone et al. (1981), J. Inf. Dis. 144: 588. 
35 Feinstone et al. (1983), Infection and Immunology 41:816. 

Feinstone, S.M. and Hoofnagle, J.H. (1984), New Engl. J. Med. 311:185. 

Fields & Knipe (1986), FUNDAMENTAL VIROLOGY (Raven Press, N.Y.). 

Fiers et al. (1978), Nature 273:113. 

Gerety, RJ. et al., in VIRAL HEPATITIS AND LIVER DISEASE (Vyas, B.N., Dienstag, J.L., and Hoofnagle, J.H., 
^o eds, Grune and Stratton, Inc., 1984) pp 23-47. 

Goeddel et al. (1980), Nucleic Acids Res. 8:4057. 

Graham and Van der Eb (1978), Virology 52:546. 

Grunstein and Hogness (1975), Proc. Natl. Acad. Sci. USA 73:3961. 

Grych et al. (1985), Nature 316:74. 
45 Gubler and Hoffman (1983), Gene 25:263. 

Hahn et al. (1988), Virology 162:167. 

Han (1987), Biochemistry 26:1617. 

Hammerling et al. (1981), MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS. 
Hess ej_al. J1968), ^ Adv. Enzyme Reg 7:149. 

so Hinnen et al. ( 1 978)7 ProcTMatL AcadrSci:'75n9"29~ " — — 

Hitzeman et al. (1980), J. Biol. Chem. 255:2073. 

Holland et al. ( 1 978), Biochemistry 17:4900. 

Holland (1981), J. Biol. Chem. 256: 1385. 

Houghton et al. (1981), Nucleic Acids Res. 9:247 
55 Hunyh, T.V. et al. (1985) in DNA CLONING TECHNIQUES; A PRACTICAL APPROACH (D. Glover, Ed., IRL Press, 

Oxford, U.K.) pp. 49-78. 

Immun. Rev. (1982) 62:185. 

Iwarson (1987), British Medical J. 295:946. 


2 


EP 0 398 748 B1 


Kennett et al. (1980) MONOCLONAL ANTIBODIES. 
Kuo et al. (1989), Science 244:362. 
Kyte and Doolittle (1982), J. Mol. Biol. 157:105-132. 
Landegren et aL (1988), Science 242:229. 
5 Maniatis, X, et al. (1982) MOLECULAR CLONING; A LABORATORY MANUAL (Cold Spring Harbor Press, Cold 

Spring Harbor, N.Y). 

Matthews and Kricka (1988), Analytical Biochemistry 169:1. METHODS IN ENZYMOLOGY (Academic Press). 
Mittlin (1989), Clinical chem. 35:1819. 
Laemmli (1970), Nature 227, 680. 
10 Lee et al. (1988), Science 239:1288. 

Loh et al. (1989), Science 243:217. 
Mackow et aL (1987), Virology 159:217. 

Mayer and Walker, eds. (1987), IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Aca- 
demic Press, London). 
15 Maxam et al. (1980), Methods in Enzymology 65:499. 

MacNamara etal. (1984), Science 226:1325. 
Messing et al. (1981), Nucleic Acids Res. 9:309. 

Messing (1983), Methods in Enzymology 101:20-37. METHODS IN ENZYMOLOGY (Academic Press). 

Michelle et al., Int. Symposium on Viral Hepatitis. 
20 Monath (1986) in THE VIRUSES: THE TOGAVIRADAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and 

Wagner, vol. eds. Schlesinger and Schlesinger, Plenum Press), p. 375-440. 

Murakawa et al. (1988), DNA 7:287. 

Nagahuma et al. (1984), Anal. Biochem. 141:74. 

Narang et al (1979), Methods in Enzymology 68:90. 
25 Neurath et al. (1984), Science 224:392. 

Nisonoff et al. (1981), Clin. Immunol. Immunopathol. 21:397-406. 

Overby, L.R. (1985), Curt. Hepatol. 5:49. 

Overby, L.R. (1986), Curr. Hepatol. 6:65. 

Overby, L.R. (1987), Curr. Hepatol. 7:35. 
30 Peleg (1969), Nature 221:193. 

Pfefferkorn and Shapiro (1974), in COMPREHENSIVE VIROLOGY, Vol. 2 (Fraenkel-Conrat & Wagner, eds., Ple- 
num, N.Y.) pp. 171-230. 

Prince, A.M. (1983), Annu. Rev. Microbiol. 37:217. 
Rice et al. (1985), Science 229:726. 
35 Rice et al. (1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and 

Wagner, vol. eds. Schlesinger and Schlesinger, Plenum Press), p.279-328. 

Roehrig (1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and 

Wagner, vol. eds. Schlesinger and Schlesinger, Plenum Press) 

Rosenberg et al. (1984), Nature 312:7. 
40 Sadler et al. (1980), Gene 8, 279. 

Saiki et al. (1985), Science 230:1350. 

Saiki et al. (1986), Nature 324: 163. 

Saiki et al. (1988), Science 239:487. 

Sanger et al. (1977), Proc. Natl. Acad. Sci. USA 74:5463. 
45 Scharf et al. (1986), Science 233:1076. 

Schlesinger et al. (1986), J. Virol. 60:1153. 

Schreier, M., etal. (1980) HYBRIDOMA TECHNIQUES 

Scopes (1984), PROTEIN PURIFICATION, PRINCIPLES AND PRACTICE, SECOND EDITION (Springer-Verlag, 
N.Y). 

50 Shimatake et al. (1981), Nature 292:128. 

Shigekawa and Dower (1988), BioTechniques 6:742. 
Steimer et al. (1986), J. Virol. 58:9. 

Stollar (1980), in THE TOGAVIRUSES (R.W. Schlesinger, ed., Academic Press, N.Y), pp. 584-622. 
Sumiyoshi et al. (1987), Virology 161:497. 
55 Taylor et al. (1976), Biochem. Biophys. Acta 442:324. 

Towbin et al. (1979), Proc. Natl. Acad. Sci. USA 76, 4350. 

Tsu and Herzenberg (1980), in SELECTED METHODS IN CELLULAR IMMUNOLOGY (W.H. Freeman and Co.) 
pp. 373-391. 


3 


EP 0 398 748 B1 


5 


Vytdehaag et al. (1985), J. Immunol. 134:1225. 
Valenzuela, P., et al. (1982), Nature 298:344. 

Valenzuela, P., et al. (1984), in HEPATITIS B (Millman, I., et al., ed, Plenum Press) pp. 225-236. 
Warner (1984), DNA 3:401. 

Wu and Grossman (1987), Methods in Enzymology Vol. 154, RECOMBINANT DNA, Part E. 
Wu (1987), Methods in Enzymology vol 155, RECOMBINANT DNA, part F. 
Zoller (1982), Nucleic Acids Res. 10:6487. 


Cited Patents 


10 


[0003] 


20 


15 


U.S. 
U.S. 
U.S. 
U.S. 
U.S. 
U.S. 
U.S. 
U.S. 


Patent 
Patent 
Patent 
Patent 
Patent 
Patent 
Patent 
Patent 


No. 
No. 
No. 
No. 
No. 
No. 
No. 
No. 


4,341,761 
4,399,121 
4,427,783 
4,444,887 
4,466,917 
4,472,500 
4,491,632 
4,493,890 


U.S. Patent No. 4,683,202 
U.S. Patent No. 4,458,066 
U.S. Patent No. 4,868,105 

25 Background Art 

[0004] Non-A, Non-B hepatitis (NANBH) is a transmissible disease or family of diseases that are believed to be viral- 
induced, and that are distinguishable from other forms of viral-associated liver diseases, including that caused by the 
known hepatitis viruses, i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), and delta hepatitis virus (HDV), as well 
30 as the hepatitis induced by cytomegalovirus (CMV) or Epstein-Barr virus (EBV). NANBH was first identified in transfused 
individuals. Transmission from man to chimpanzee and serial passage in chimpanzees provided evidence that NANBH 
is due to a transmissible infectious agent or agents. 

[0005] Epidemiologic evidence is suggestive that there may be three types of NANBH: the water-borne epidemic 
type; the blood or needle associated type; and the sporadically occurring (community acquired) type. However, the 
35 number of agents which may be the causative of NANBH are unknown. 

[0006] There have been a number of candidate NANBV. See, for example the reviews by Prince (1983), Feinstone 
and Hoofnagle (1984), and Overby (1985, 1986, 1987) and the article by Iwarson (1987). However, there is no proof 
that any of these candidates represent the etiological agent of NANBH. 

[0007] The demand for sensitive, specific methods for screening and identifying carriers of NANBV and NANBV 
40 contaminated blood or blood products is significant. Post-transfusion hepatitis (PTH) occurs in approximately 10% of 
transfused patients, and NANBH accounts for up to 90% of these cases. The major problem in this disease is the 
frequent progression to chronic liver damage (25-55%). 

[0008] Patient care as well as the prevention of transmission of NANBH by blood and blood products or by close 
personal contact require reliable screening, diagnostic and prognostic tools to detect nucleic acids, antigens and an- 

45 tibodies related to NANBV. 

[0009] Methods for detecting specific polynucleotides by hybridization assays are known in the art. See, for example, 
Matthews and Kricka (1988), Analytical Biochemistry 169:1; Landegren et al. (1988), Science 242:229; and Mittlin 
(1989), Clinical chem. 35:1819. U.S. Patent No. 4,868,105, issued Sept. 9, 1989, and in E.P.O. Publication No. 225,807 

(published June 16, 1987)^ 

so [0010] Applicant discovered a new virus, the Hepatitis C VirusXHCV), which has - proven to be"the~majc7"etiologic 
agent of blood-borne NANBH (BB-NANBH). Applicant's Initial work, including a partial genomic sequence of the pro- 
totype HCV isolate, CDC/HCV1 (also called HCV1), is described in E.P.O. Publication No. 318,216 (published 31 May 
1 989) and PCT Pub. No. WO 89/04669 (published 1 June 1 989). The disclosures of these patent applications, as well 
as any corresponding national patent applications, are incorporated herein by reference. These applications teach 

55 inter alia, recombinant DNA methods of cloning HCV sequences, HCV probe diagnostic techniques, anti-HCV anti- 
bodies, and methods of isolating new HCV sequences. 
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Disclosure of the Invention 


[0011] The present invention is based on HCV sequences described in EP.O. Publication No. 318,216 and in PCT 
Pub. No. WO 89/04669, as well as other HCV sequences that are described herein. Methods for isolating and/or 
5 detecting specific polynucleotides by hybridization could not be used for screening for HCV until Applicants' discovery 
of HCV. Accordingly, one aspect of the invention is a method for the detection of variant hepatitis C virus (HCV) strains 
in a sample which method comprises: 


a) subjecting the sample to a polymerase chain reaction (PCR), using primers of sufficient length to prime the 
10 synthesis of extension products in the presence of an agent for polymerization wherein at least one primer is 

capable of selectively hybridizing to a conserved region of an HCV genome as shown in Figure 18 or complement 
thereof; and 

b) analysing the size or sequence of any amplified product. 

15 [0012] Another aspect of the invention is a polypeptide primer selected from: 

5'CTC TAT GGC AAT GAG G3'; 
20 5'CGT TGG CAT AAC TGA T3'; 

5'CGA CAA GAA AGA CAG A3 1 ; 


5'AGC TTC GAC GTC ACA TJ; 
5TGA ACT ATG CAA CAG G3 1 ; 
30 5'GGA GTG TGC AGG ATG G3'; 

5'AAG GTT GCA ATT GCT C3*; and 
5'ACT AAC AGG ACC TTC G3\ 

35 

[0013] Fig. 1 shows the sequence of the HCV cDNA in clone 12f, and the amino acids enclosed therein. 
[0014] Fig. 2 shows the HCV cDNA sequence in clone k9-1, and the amino acids enclosed therein. 
[0015] Fig. 3 shows the sequence of clone 15e, and the amino acids encoded therein. 

[0016] Fig. 4 shows the nucleotide sequence of HCV cDNA in clone 13i, the amino acids enclosed therein, and the 
40 sequences which overlap with clone 12f. 

[0017] Fig. 5 shows the nucleotide sequence of HCV cDNA in clone 26j, the amino acids encoded therein, and the 
sequences which overlap clone 13i. 

[0018] Fig. 6 shows the nucleotide sequence of HCV cDNA in clone CA59a, the amino acids encoded therein, and 
the sequences which overlap with clones 26j and K9-1. 
45 [0019] Fig. 7 shows the nucleotide sequence of HCV cDNA in clone CA84a, the amino acids encoded therein, and 
the sequences which overlap with clone CA59a. 

[0020] Fig. 8 shows the nucleotide sequence of HCV cDNA in clone CA1 56e, the amino acids encoded therein, and 
the sequences which overlap with CA84a. 

[0021] Fig. 9 shows the nucleotide sequence of HCV cDNA in clone CA167b, the amino acids encoded therein, and 
50 the sequences which overlap CA156e. 

[0022] Fig. 10 shows the nucleotide sequence of HCV cDNA in clone CA216a, the amino acids encoded therein, 
and the overlap with clone CA167b. 

[0023] Fig. 11 shows the nucleotide sequence of HCV cDNA in clone CA290a, the amino acids encoded therein, 
and the overlap with clone CA216a. 
55 [0024] Fig. 12 shows the nucleotide sequence of HCV cDNA in clone ag30a and the overlap with clone CA290a. 
[0025] Fig. 13 shows the nucleotide sequence of HCV cDNA in clone CA205a, and the overlap with the HCV cDNA 
sequence in clone CA290a. 

[0026] Fig. 14 shows the nucleotide sequence of HCV cDNA in clone 18g, and the overlap with the HCV cDNA 
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sequence in clone ag30a. 

[0027] Fig. 15 shows the nucleotide sequence of HCV cDNA in clone 16jh, the amino acids encoded therein, and 
the overlap of nucleotides with the HCV cDNA sequence in clone 15e. 

[0028] Fig. 16 shows the nucleotide sequence of HCV cDNA in clone 6k, the amino acids encoded therein, and the 
5 overlap of nucleotides with the HCV cDNA sequence in clone 16jh. 

[0029] Fig. 1 7 shows the nucleotide sequence of HCV cDNA in clone p1 31 jh, the amino acids encoded therein, and 
the overlap of nucleotides with the HCV cDNA sequence in clone 6k. 

[0030] Fig. 18 shows the the compiled HCV cDNA sequence derived from the clones described herein and from the 
compiled HCV cDNA sequence presented in E.P.O. Publication No. 318,216. The clones from which the sequence 

10 was derived are 5'-clone32, b114a, 18g, ag30a, CA205a, CA290a, CA216a, pi14a, CA167b, CA156e, CA84a, CA59a, 
K9-1 (also called k9-1),26j, 13i, 12f, 14i, 11b, 7f, 7e, 8h, 33c, 40b, 37b, 35, 36, 81, 32, 33b, 25c, 14c, 8f, 33f, 33g, 39c, 
35f, 19g, 26g, 15e, b5a, 16jh, 6k, and p131jh. In the figure the three horizontal dashes above the sequence indicate 
the position of the putative initiator methionine codon. Also shown in the figure is the amino acid sequence of the 
putative polyprotein encoded in the HCV cDNA. Heterogeneities in cloned DNAs of HCV1 are indicated by the amino 

15 acids indicated above the putatively encoded sequence of the large ORF; the parentheses indicate that the heteroge- 
neity was detected at or near to the 5'- or 3'- end of the HCV cDNA in the clone. 

[0031] Fig. 20 shows schematic alignment of a flaviviral polyprotein and a putative HCV polyprotein encoded in the 
major ORF of the HCV genome. Also indicated in the figure are the possible functions of the flaviviral polypeptides 
cleaved from the flaviral polyprotein. In addition, the relative placements of the HCV polypeptides, NANBg.^ and C100, 
20 with respect to the putative HCV polyprotein are indicated. 

[0032] Fig. 22 shows the double-stranded nucleotide sequence of the HCV cDNA insert in clone 81 , and the putative 
amino acid sequence of the polypeptide encoded therein. 

[0033] Fig. 23 shows the HCV cDNA sequence in clone 36, the segment which overlaps the NANBV cDNA of clone 
81, and the polypeptide sequence encoded within clone 36. 
25 [0034] Fig. 24 shows the HCV cDNA sequence in clone 37b, the segment which overlaps clone 35, and the polypep- 
tide encoded therein. 

[0035] Fig. 25 shows autoradiographs of the HCV cPCR assay on RNA derived from liver samples of chimpanzees 
with NANBH (Fig. 25A) and on Italian patients with NANBH (Fig. 25B). 

[0036] Fig. 26A and 26B are graphs showing the temporal relationship between the display of liver damage, the 
30 presence of HCV RNA, and the presence of anti-HCV antibodies for two chimpanzees with NANBH. 

[0037] Fig. 27 shows the nucleotide sequence of HCV cDNA in clone CA84a, the amino acids encoded therein, and 
the sequences which overlap with clone CA59a. 

[0038] Fig. 28 shows the HCV cDNA sequence in clone 40b, the segment which overlaps clone 37b, and the polypep- 
tide encoded therein. 

35 [0039] Fig. 29 is an autoradiograph showing the labeled amplified products of approximately 300, 30, and 3 CID of 
HCV genomes. 

[0040] Fig. 32 shows the nucleotide sequence of HCV cDNA in clone 40a. 

[0041] Fig. 33 is an autoradiograph showing amplified products extended from primers derived from conserved re- 
gions of the HCV genome. 

40 [0042] Fig. 34 shows the HCV cDNA sequence in clone 35, the segment which overlaps clone 36, and the polypeptide 
encoded therein. 

[0043] Fig. 37 is a diagram showing the relationship of probes and primers derived from the 5'-region of HCV RNA, 
from which the HCV cDNAs in clones ag30a and k9-1 are derived. 

[0044] Fig. 38 is an autoradiograph of amplified products extended from sets of primers derived from ag30a and k9-1 . 

45 [0045] Fig. 39 shows the aligned nucleotide sequences of human isolates 23 and 27 and of HCV1. Homologous 
sequences are indicated by the symbol (*). Non homologous sequences are in small letters. 
[0046] Fig. 40 shows the aligned amino acid sequences of human isolates 23 and 27 and of HCV1. Homologous 
sequences are indicated by the symbol (*). Non homologous sequences are in small letters. 
— [0047]- Fig.-4_1_shows a half-tgne reproo^jction of ain autoradiograph of a Northern blot of RNA isolated from the liver 

50 of a BB-NANBV infected chimpanzee, probed with B B-N AN BV cDN A~of clone~8i: 

[0048] Fig. 43 shows a half-tone reproduction of an autoradiograph of nucleic acids extracted from NANBV particles 
captured from infected plasma with anti-NANBg.-,. and probed with 32 P-labeled NANBV cDNA from clone 81. 
[0049] Fig. 44 shows reproductions of autoradiographs of filters containing isolated NANBV nucleic acids, probed 
with 32 P-labeled plus and minus strand DNA probes derived from NANBV cDNA in clone 81 . 

55 [0050] Fig. 46 shows the nucleotide consensus sequence of human isolate 23, variant sequences are shown below 
the sequence line. The amino acids encoded in the consensus sequence are also shown. 

[0051] Fig. 47 shows the nucleotide consensus sequence of human isolate 27, variant sequences are shown below 
the sequence line. The amino acids encoded in the consensus sequence are also shown. 
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[0052] Fig. 48 is a graph showing the relationship of the EnvL and EnvR primers to the model flavivirus polyprotein 
and putative HCV polyprotein. 

[0053] Fig. 49 shows a comparison of the composite aligned nucleotide sequences of isolates Thorn, EC1, HCT 
#18, and HCV1. 

5 [0054] Fig. 50 shows a comparison of the nucleotide sequences of EC 10 and a composite of the HCV1 sequence; 

the EC 10 sequence is on the line above the dots, and the HCV1 sequence is on the line below the dots. 

[0055] Fig. 51 shows a comparison of the amino acid sequences 117-308 (relative to HCV1) encoded in the "EnvL" 

regions of the consensus sequences of human isolates HCT #18, JH23, JH 27, Thome, EC1, and of HCV1. 

[0056] Fig. 52 shows a comparison of the amino acid sequences 330-360 (relative to HCV1 ) encoded in the "EnvR" 
10 regions of the consensus sequences of human isolates HCT #18, JH23, JH 27, Thorne, EC1, and of HCV1. 

[0057] Fig. 53 shows the nucleotide sequences of individual primers in primer mixture 5-3. 

Modes for Carrying Out the Invention 

15 [0058] The term "hepatitis C virus" (HCV) has been reserved by workers in the field for an heretofore unknown 
etiologic agent of NANBH. The prototype isolate of HCV has been identified in U.S.S.N. 122,714 (See also E.RO. 
Publication No. 318,216). The term HCV also includes new isolates of the same viral species. As an extension of this 
terminology, the disease caused by HCV, formerly called blood-borne NANB hepatitis (BB-NANBH), is called hepatitis 
C. The terms NANBH and hepatitis C may be used interchangeably herein. 

20 [0059] HCV is a viral species of which pathogenic strains cause BB-NANBH. There may also be attenuated strains 
or defective interfering particles derived therefrom. As shown infra, the HCV genome is comprised of RNA. It is known 
that RNA containing viruses have relatively high rates of spontaneous mutation, i.e., reportedly on the order of 10* 3 to 
10* 4 per incorporated nucleotide (Fields & Knipe (1986)). Therefore, since heterogeneity and fluidity of genotype are 
inherent in RNA viruses, there are multiple strains/isolates, which may be virulent or avirulent, within the HCV species. 

25 The compositions and methods described herein, enable the propagation, identification, detection, and isolation of the 
various HCV strains or isolates. 

[0060] Several different strains/isolates of HCV have been identified. (See infra). One such strain or isolate, which 
is a prototype, is named CDC/HCV1 (also called HCV1). Information from one strain or isolate, such as a partial genomic 
sequence, is sufficient to allow those skilled in the art using standard techniques to isolate new strains/isolates and to 
30 identify whether such new strains/isolates are HCV. For example, several different strains/isolates are described infra. 
These strains, which were obtained from a number of human sera (and from different geographical areas), were isolated 
utilizing the information from the genomic sequence of HCV1 . 

[0061] Using the techniques described in E.RO. Publication No. 318,216 and infra, the genomic structure and the 
nucleotide sequence of HCV1 genomic RNA has been deduced. The genome appears to be single-stranded RNA 
35 containing -10,000 nucleotides. The genome is positive-stranded, and possesses a continuous, translational open 
reading frame (ORF) that encodes a polyprotein of about 3,000 amino acids. In the ORF, the structural protein(s) appear 
to be encoded in approximately the first quarter of the N-terminus region, with the majority of the polyprotein responsible 
for non-structural proteins. When compared with all known viral sequences, small but significant co-linear homologies 
are observed with the non-structural proteins of the flavivirus family, and with the pestiviruses (which are now also 
40 considered to be part of the Flavirus family). 

[0062] A schematic alignment of possible regions of a flaviviral polyprotein (using Yellow Fever Virus as an example), 
and of a putative polyprotein encoded in the major ORF of the HCV genome, is shown in Fig. 20. In the figure the 
possible domains of the HCV polyprotein are indicated. The flavivirus polyprotein contains, from the amino terminus 
to the carboxy terminus, the nucleocapsid protein (C), the matrix protein (M), the envelope protein (E), and the non- 
45 structural proteins (NS) 1, 2 (a+b), 3, 4 (a+b), and 5. Based upon the putative amino acids encoded in the nucleotide 
sequence of HCV1 , a small domain at the extreme N-terminus of the HCV polyprotein appears similar both in size and 
high content of basic residues to the nucleocapsid protein (C) found at the N-terminus of flaviviral polyproteins. The 
non-structural proteins 2,3,4, and 5 (NS2-5) of HCV and of yellow fever virus (YFV) appear to have counter parts of 
similar size and hydropathicity, although there is divergence of the amino acid sequences. However, the region of HCV 
50 which would correspond to the regions of YFV polyprotein which contains the M, E, and NS1 protein not only differs 
in sequence, but also appears to be quite different both in size and hydropathicity. Thus, while certain domains of the 
HCV genome may be referred to herein as, for example, NS1 , or NS2, it should be borne in mind that these designations 
are speculative; there may be considerable differences between the HCV family and flavtviruses that have yet to be 
appreciated. 

55 [0063] Different strains, isolates or subtypes of HCV are expected to contain variations at the amino acid and nucleic 
acids compared with HCV1. Many isolates are expected to show much (i.e., more than about 40%) homology in the 
total amino acid sequence compared with HCV1. However, it may also be found that there are other less homologous 
HCV isolates. These would be defined as HCV according to various criteria such as, for example, an ORF of approx- 
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imately 9,000 nucleotides to approximately 12,000 nucleotides, encoding a polyprotein similar in size to that of HCV1, 
an encoded polyprotein of similar hydrophobic and/or antigenic character to that of HCV1, and the presence of co- 
linear peptide sequences that are conserved with HCV1 . In addition, it is believed that the genome would be a positive- 
stranded RNA. 

[0064] All HCV isolates encode at least one epitope which is immunologically identifiable (i.e. , immunologically cross- 
reactive) with an epitope encoded in the HCV cDNAs described herein. Preferably the epitope is contained in an amino 
acid sequence described herein and is unique to HCV when compared to previously known pathogens. The uniqueness 
of the epitope may be determined by its immunological reactivity with anti-HCV antibodies and lack of immunological 
reactivity with antibodies to known pathogens. 

[0065] HCV strains and isolates are evolutionarily related. Therefore, it is expected that the overall homology of the 
genomes at the nucleotide level may be about 40% or greater, probably will be about 50% or greater, probably about 
60% or greater, and even more probably about 80% or greater; and in addition that there will be corresponding con- 
tiguous sequences of at least about 13 nucleotides. It should be noted, as shown infra, that there are variable and 
hypervariable regions within the HCV genome; therefore, the homology in these regions is expected to be significantly 
less than that in the overall genome. The correspondence between the putative HCV strain genomic sequence and, 
for example, the CDC/HCV1 cDNA sequence can be determined by techniques known in the art. For example, they 
can be determined by a direct comparison of the sequence information of the polynucleotide from the putative HCV, 
and the HCV cDNA sequence(s) described herein. They also can be determined by hybridization of the polynucleotides 
under conditions which form stable duplexes between homologous regions (for example, those which would be used 
prior to S 1 digestion), followed by digestion with single stranded specific nuclease(s), followed by size determination 
of the digested fragments. 

[0066] Because of the evolutionary relationship of the strains or isolates of HCV, putative HCV strains or isolates are 
identifiable by their homology at the polypeptide level. Generally, HCV strains or isolates are expected to be at least 
40% homologous, more than about 50% homologous, probably more than about 70% homologous, and even more 
probably more than about 80% homologous, and some may even be more than about 90% homologous at the polypep- 
tide level. The techniques for determining amino acid sequence homology are known in the art. For example, the amino 
acid sequence may be determined directly and compared to the sequences provided herein. Alternatively the nucleotide 
sequence of the genomic material of the putative HCV may be determined (usually via a cDNA intermediate), the 
putative amino acid sequence encoded therein can be determined, and the corresponding regions compared. 
[0067] As used herein, a polynucleotide "derived from" a designated sequence refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, 
more preferably at least about 10-12 nucleotides, and even more preferably at least about 15-20 nucleotides corre- 
sponding to a region of the designated nucleotide sequence. "Corresponding" means homologous to or complementary 
to the designated sequence. Preferably, the sequence of the region from which the polynucleotide is derived is homol- 
ogous to or complementary to a sequence which is unique to an HCV genome. More preferably, the derived sequence 
is homologous or complementary to a sequence that is unique to all or to a majority of HCV isolates. Whether or not 
a sequence is unique to the HCV genome can be determined by techniques known to those of skill in the art. For 
example, the sequence can be compared to sequences in databanks, e.g., Genebank, to determine whether it is 
present in the uninfected host or other organisms. The sequence can also be compared to the known sequences of 
other viral agents, including those which are known to jnduce hepatitis, e.g., HAV, HBV, and HDV, and to members of 
the Flaviviridae. The correspondence or non-correspondence of the derived sequence to other sequences can also 
be determined by hybridization under the appropriate stringency conditions. Hybridization techniques for determining 
the complementarity of nucleic acid sequences are known in the art, and are discussed infra. See also, for example, 
Maniatis et al. (1982). In addition, mismatches of duplex polynucleotides formed by hybridization can be determined 
by known techniques, including for example, digestion with a nuclease such as S1 that specifically digests single- 
stranded areas in duplex polynucleotides. Regions from which typical DNA sequences may be "derived" include but 
are not limited to, for example, regions encoding specific epitopes, as well as non-transcribed and/or non-translated 
regions. 

[0068] _The_ ^derived p olynu cleotide is not necessarily physically derived from the nucleotide sequence shown, but 
may be generated in any manTw,lncluding"fdr example; chemical synthesis or DNA replication or reverse transcription- 
or transcription. In addition, combinations of regions corresponding to that of the designated sequence may be modified 
in ways known in the art to be consistent with an intended use. 

[0069] The term "recombinant polynucleotide" as used herein intends a polynucleotide of genomic, cDNA, semisyn- 
thetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all or a portion of a 
polynucleotide with which it is associated in nature, (2) is linked to a polynucleotide other than that to which it is linked 
in nature, or (3) does not occur in nature. 

[0070] The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either ribo- 
nucleotides or deoxyribonucfeotides. This term refers only to the primary structure of the molecule. Thus, this term 
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includes double- and single-stranded DNA and RNA. It also includes known types of modifications, for example, labels 
which are known in the art, methylation, "caps", substitution of one or more of the naturally occurring nucleotides with 
an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phospho- 
nates, phosphotriesters, phosphoamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, 
5 phosphorodithioates, etc.), those containing pendant moieties, such as, for example proteins (including for e.g., nu- 
cleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen; etc.), 
those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing alkylators, 
those with modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucle- 
otide. 

10 [0071] As used herein, the "sense strand" of a nucleic acid contains the sequence that has sequence homology to 
that of mRNA. The "anti-sense strand" contains a sequence which is complementary to that of the "sense strand". 
[0072] As used herein, a "positive stranded genome" of a virus is one in which the genome, whether RNA or DNA, 
is single-stranded and which encodes a viral polypeptide(s). Examples of positive stranded RNA viruses include To- 
gaviridae, Coronaviridae, Retroviridae, Picornaviridae, and Caliciviridae. Included also, are the Flaviviridae, which were 

15 formerly classified as Togaviradae. See Fields & Knipe (1986). 

[0073] The term "primer" as used herein refers to an oligomer which is capable of acting as a point of initiation of 
synthesis of a polynucleotide strand when placed under appropriate conditions. The primer will be completely or sub- 
stantially complementary to a region of the polynucleotide strand to be copied. Thus, under conditions conducive to 
hybridization, the primer will anneal to the complementary region of the anatyte strand. Upon addition of suitable re- 

20 actants, (e.g., a polymerase, nucleotide triphosphates, and the like), the primer is extended by the polymerizing agent 
to form a copy of the analyte strand. The primer may be single-stranded, or alternatively may be partially or fully double- 
stranded. 

[0074] The terms "analyte polynucleotide" and "analyte strand" refer to a single- or double-stranded nucleic acid 
molecule which is suspected of containing a target sequence, and which may be present in a biological sample. 

25 [0075] As used herein, the term "oligomer" refers to primers and to probes. The term oligomer does not connote the 
size of the molecule. However, typically oligomers are no greater than 1000 nucleotides, more typically are no greater 
than 500 nucleotides, even more typically are no greater than 250 nucleotides; they may be no greater than 100 
nucleotides, and may be no greater than 75 nucleotides, and also may be no greater than 50 nucleotides in length. 
[0076] As used herein, the term "probe" refers to a structure comprised of a polynucleotide which forms a hybrid 

30 structure with a target sequence, due to complementarity of at least one sequence in the probe with a sequence in the 
target region. The polynucleotide regions of probes may be composed of DNA, and/or RNA, and/ or synthetic nucleotide 
analogs. Included within probes are "capture probes" and "label probes". Preferably the probe does not contain a 
sequence complementary to sequence(s) used to prime the polymerase chain reaction (PCR). 
[0077] As used herein, the term "target region" refers to a region of the nucleic acid which is to be amplified and/or 

35 detected. The term "target sequence" refers to a sequence with which a probe or primer will form a stable hybrid under 
desired conditions. 

[0078] The term "targeting polynucleotide sequence" as used herein, refers to a polynucleotide sequence which is 
comprised of nucleotides which are complementary to a target nucleotide sequence; the sequence is of sufficient length 
and complementarity with the target sequence to form a duplex which has sufficient stability for the purpose intended. 

40 [0079] The term "binding partner" as used herein refers to a molecule capable of binding a ligand molecule with high 
specificity, as for example an antigen and an antibody specific therefor. In general, the specific binding partners must 
bind with sufficient affinity to immobilize the analyte copy/complementary strand duplex under the isolation conditions. 
Specific binding partners are known in the art, and include, for example, biotin and avidin or streptavidin, IgG and 
protein A, the numerous known receptor-ligand couples, and complementary polynucleotide strands. In the case of 

45 complementary polynucleotide binding partners, the partners are normally at least about 15 bases in length, and may 
be at least 40 bases in length; in addition, they have a content of Gs and Cs of at least about 40% and as much as 
about 60%. The polynucleotides may be composed of DNA, RNA, or synthetic nucleotide analogs. 
[0080] The term "coupled" as used herein refers to attachment by covalent bonds or by strong non-covalent inter- 
actions (e.g., hydrophobic interactions, hydrogen bonds, etc.). Covalent bonds may be, for example, ester, ether, phos- 

50 phoester, amide, peptide, imide, carbon-sulfur bonds, carbon-phosphorus bonds, and the like. 

[0081] The term "support" refers to any solid or semisolid surface to which a desired binding partner may be anchored. 
Suitable supports include glass, plastic, metal, polymer gels, and the like, and may take the form of beads, wells, 
dipstics, membranes, and the like. 

[0082] The term "label" as used herein refers to any atom or moiety which can be used to provide a detectable 
55 (preferably quantifiable) signal, and which can be attached to a polynucleotide or polypeptide. 

[0083] The term "multimer", as used herein, refers to linear or branched polymers of the same repeating single- 
stranded polynucleotide unit or different single-stranded polynucleotide units. At least one of the units has a sequence, 
length, and composition that permits it to hybridize specifically to a first single-stranded nucleotide sequence of interest, 
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typically an anatyte or an oligomer bound to an analyte. In order to achieve such specificity and stability, this unit will 
normally be at least about 1 5 nucleotides in length, typically no more than about 50 nucleotides in length, and preferably 
about 30 nucleotides in length; moreover, the content of Gs and Cs will normally be at least about 40%, and at most 
about 60%. In addition to such unit(s), the multimer includes a multiplicity of units that are capable of hybridizing 

5 specifically and stably to a second single-stranded nucleotide of interest, typically a labeled polynucleotide or another 
multimer. These units are generally about the same size and composition as the multimers discussed above. When a 
multimer is designed to be hybridized to another multimer, the first and second oligonucleotide units are heterogeneous 
(different), and do not hybridize with each other under the conditions of the selected assay. Thus, multimers may be 
ligands which couple the label to the probe. 

10 [0084] As used herein, the term "viral RNA", which includes HCV RNA, refers to RNA from the viral genome, frag- 
ments thereof, transcripts thereof, and mutant sequences derived therefrom. 

[0085] As used herein, a "biological sample" refers to a sample of tissue or fluid isolated from an individual, including 
but not limited to, for example, plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respiratory, 
intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs, and also samples of in vitro cell 
*5 culture constituents (including but not limited to conditioned medium resulting from the growth of cells in cell culture 
medium, putatively virally infected cells, recombinant cells, and cell components). 

Description of the Invention 

20 [0086] The practice of the present invention will employ, unless otherwise indicated, conventional techniques of 
chemistry, molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See e.g., Maniatis, Fitsch & Sambrook, MOLECULAR CLONING; 
A LABORATORY MANUAL (1982); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); OLIGONUCLEOTIDE 
SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds. 1984); the 

25 series, METHODS IN ENZYMOLOGY (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and Grossman, 
and Wu, eds., respectively). AH patents, patent applications, and publications mentioned herein, both supra and infra, 
are hereby incorporated herein by reference. 

[0087] The useful materials and processes of the present invention are made possible by the identification of HCV 
as the etiologic agent of BB-NANBV, and by the provision of a family of nucleotide sequences isolated from cDNA 
30 libraries which contain HCV cDNA sequences. These cDNA libraries were derived from nucleic acid sequences present 
in the plasma of an HCV-infected chimpanzee. The construction of one of these libraries, the "c" library (ATCC No. 
40394), is described in E.P.O. Publication No. 318,216. 

[0088] Utilizing the above-described HCV cDNA sequences, as well as that described herein, oligomers can be 
constructed which are useful as reagents for detecting viral polynucleotides in biological samples. The novel oligomers 
35 described herein enable further characterization of the HCV genome. Polynucleotide primers derived from these se- 
quences may be used to amplify sequences present in cDNA libraries, and/or to screen cDNA libraries for additional 
overlapping cDNA sequences, which, in turn, may be used to obtain more overlapping sequences. As indicated infra, 
and in E.P.O. Publication No. 318,216, the genome of HCV appears to be RNA comprised primarily of a large open 
reading frame (ORF) which encodes a large polyprotein. 
40 [0089] In addition to the above, the information provided infra allows the identification of additional HCV strains or 
isolates. The isolation and characterization of the additional HCV strains or isolates may be accomplished utilizing 
techniques known to those of skill in the art, for example, by isolating the nucleic acids from body components which 
contain viral particles and/or viral RNA, creating cDNA libraries using the oligomers described infra., for screening the 
libraries for clones containing HCV cDNA sequences described infra., and comparing the HCV cDNAs from the new 
^5 isolates with the cDNAs described in E.P.O. Publication No. 318,216 and infra. Strains or isolates which fit within the 
parameters of HCV, as described in the Definitions section, supra., are readily identifiable. Other methods for identifying 
HCV strains will be obvious to those of skill in the art, based upon the information provided herein. 

Isolation of the HCV cDNA Sequences 

[0090] The oligomers of the invention contain regions which form hybrid duplex structures with targeted sequences 
in HCV polynucleotides. The HCV polynucleotide hybridizing regions of the oligomers may be ascertained from the 
HCV cDNA sequence(s) provided herein, and described in E.P.O. Publication No. 318,216. A composite of HCV cDNA 
from HCV1, a prototypic HCV, is shown in Fig. 18. The composite sequence is based upon sequence information 
derived from a number of HCV cDNA clones, which were isolated from a number of HCV cDNA libraries, including the 
"c" library present in lambda gt11 (ATCC No. 40394), and from human serum. The HCV cDNA clones were isolated 
by methods described in E.P.O. Publication No. 318,216. Briefly, the majority of clones which were isolated contained 
sequences from the HCV cDNA "c" library which was constructed using pooled serum from a chimpanzee with chronic 
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HCV infection and containing a high titer of the virus, i.e., at least 10 6 chimp infectious doses/ml (CID/ml). The pooled 
serum was used to isolate viral particles; nucleic acids isolated from these particles was used as the template in the 
construction of cDNA libraries to the viral genome. The initial clone, 5-1-1, was obtained by screening the V library 
with serum from infected individuals. After the isolation of the initial clone, the remainder of the sequence was obtained 
5 by screening with synthetic polynucleotide probes, the sequences of which were derived from the 5*-region and the 3'- 
region of the known HCV cDNA sequence(s). 

[0091] The description of the methods to retrieve the cDNA sequences is mostly of historical interest. The resultant 
sequences (and their complements) are provided herein, and the sequences, or any portion thereof, could be prepared 
using synthetic methods, or by a combination of synthetic methods with retrieval of partial sequences using methods 
10 similar to those described in E.P.O. Publication No. 318,216. 

Oligomer Primers 

[0092] Using as a basis the HCV genome (as illustrated in Fig. 18), and/or preferably conserved regions of the HCV 
15 genome, oligomers of approximately 8 nucleotides or more can be prepared which hybridize with the positive strand 
(s) of HCV RNA or its complement, as well as to HCV cDNAs. These oligomers can serve as primers for the transcription 
and/or replication of targeted HCV sequences. The oligomers contain a targeting polynucleotide sequence, which is 
comprised of nucleotides which are complementary to a target HCV nucleotide sequence; the sequence is of sufficient 
length and complementarity with the HCV sequence to form a duplex which has sufficient stability for the purpose 
20 intended. For example, if the purpose is the isolation, via immobilization, of an analyte containing a target HCV se- 
quence, the oligomers would contain a polynucleotide region which is of sufficient length and complementarity to the 
targeted HCV sequence to afford sufficient duplex stability to immobilize the analyte on a solid surface, via its binding 
to the oligomers, under the isolation conditions. For example, also, if the oligomers are to serve as primers for the 
transcription and/or replication of target HCV sequences in an analyte polynucleotide, the oligomers would contain a 
25 polynucleotide region of sufficient length and complementarity to the targeted HCV sequence to allow the polymerizing 
agent to continue replication from the primers which are in stable duplex form with the target sequence, under the 
polymerizing conditions. 

[0093] The oligomers may contain a minimum of about 4 contiguous nucleotides which are complementary to targeted 
HCV sequence; usually the oligomers will contain a minimum of about 8 continguous nucleotides which are comple- 
30 mentary to the targeted HCV sequence, and preferably will contain a minimum of about 14 contiguous nucleotides 
which are complementary to the targeted HCV sequence. 

[0094] Suitable HCV nucleotide targeting sequences may be comprised of nucleotides which are complementary 
nucleotides selected from the following HCV cDNA nucleotides, which are shown in Fig. 18, (nn x - nn y denotes from 
about nucleotide number x to about nucleotide number y)): 

35 

nn -340 " nn -330! nn -330 " nn -320- nn -320 * nn -310» 
nn -310 " nn -30(V nn -300 " nn -290- nn -290 " nn -280» 
nn -280 " nn -27(h nn -270 " nn -260» nn -260 * nn -250: 
nn -250 " nn -240» nn -240 * nn -230» nn -230 " nn -220* 

40 nn_22o * nn_ 210 ; nn -2io ~ nn -200» nn -200 " nn -i90* 

nn -190 * nn -180l nn -180 " nn -170: nn -170 " nn -160* 
nn -160 * nn -150: nn -150 * nn -140l nn -140 ' nn -130» 
nn -130 _ nn -120: nn -120 " nn -110l nn -110 * nn -100: 

nn. 100 - nn_ 90 ; nn.gg - nn_Q 0 ; nn^ 0 - nn. 70 ; 

45 nri-70 * nn -60» nn -60 _ nn -50l nn -50 * nn -4oI 

nn_4 0 - nn. 30 ; nn. 30 - nn. 20 ; nn. 2 o - nn. 10 ; 

nn. 10 - nn^ nn-, - nn 10 ; nn 10 - nn 2 o; rm 20 - nn 30 ; 

nn 30 - nn 40 ; nn 40 - nn 50 ; nn 50 - nn 60 ; nn 60 - nn 70 ; 

nn 70 - nn 80 ; nn 80 - nn 90 ; nn 90 - nn 100 ; nn 100 - nn 110 ; 
so nn 110 - nn 120 ; nn 120 - nn 130 ; nn 130 - nn 140 ; 

nn 140 - nn 150 ; nn 150 - nn 160 ; nn 160 - nn 170 ; 

nn 170 - nn 180 ; nn 180 - nn 190 ; nn 190 - nn 200 ; 

nn 200 - nn 210 ; nn 210 - nn 220 ; nn 220 - nn 230 ; 

nn 230 - nn 240 ; nn 240 - nn 250 ; nn 250 - nn 260 ; 
55 nn 260 - nn 270 ; nn 270 - nn 280 ; nn 280 - nn 290 ; 

nn 290 - nn 300 ; nn 300 - nn 310 ; nn 310 - nn 320 ; 

nn 320 - nn 330 ; nn 330 - nn 340 ; nn^Q - nn 350 ; 

n n 350 " nn 360« nn 360 " nn 370* nn 370 * nn 380» 
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""1150- ""1160= 
""1180" " n 1190: 
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nn 2120 * nn 2130; nn 2130 " nn 2140- 
nn 2150 _ nn 2160» nn 2160 " nn 2170! 
nn 2180 _ nn 2190* nn 2190" nn 2200< 
nn 2210 - nn 2220' nn 2220 " nn 2230' 
nn 2240 _ nn 2250 ; nn 2250 " nn 2260» 
nn 2270 " nn 2280 ; nn 2280 " nn 2290> 
nn 2300" nn 2310- nn 2310 " nn 2320> 
nn 2330 ' nn 2340l nn 2340 _ nn 2350! 
nn 2360 ' nn 2370 ; nn 2370 " nn 2380 : 
nn 2390 _ nn 2400 : nn 2400 " nn 2410' 
nn 2420 " nn 2430' nn 2430 " nn 244()! 
nn 2450 " nn 2460> nn 2460 " nn 247Ch 
nn 2480 " nn 2490» nn 2490 " nn 2500' 
nn 2510 " nn 2520- nn 2520 ' nn 2530- 
nn 2540 * nn 2550i nn 2550 ' nn 2560' 
nn 2570 " nn 2580J nn 2580 " nn 2590» 
nn 2600 " nn 2610» nn 2610 * nn 2620> 
nn 2630 " nn 2640» nn 2640 - nn 2650- 
nn 2660 _ nn 2670l nn 2670 " nn 268()i 
nn 2690 " nn 2700 ; nn 2700 * nn 2710 ; 
nn 2720 * nn 2730' nn 2730 " nn 2740- 
nn 2750 _ nn 2760- nn 2760 " nn 2770- 
nn 2780 " nn 2790 ; nn 2790 ' nn 2800« 
nn 2810 _ nn 2820 ; nn 2820 " nn 2830 ; 
nn 2840 _ nn 2850» nn 2850 " nn 2860» 
nn 2870 _ nn 2880» nn 2880 " nn 2890^ 
nn 2900 " nn 2910* nn 2910 " nn 292(V 
nn 2930 " nn 2940i nn 2940 " nn 2950- 
nn 2960 " nn 2970» nn 2970 ' nn 2980 ; 
nn 2990 ' nn 3000* nn 3000 * nn 3010« 
nn 3020 " nn 3030l nn 3030 " nn 3040' 
nn 3050 ' nn 3060- nn 3060 ' nn 3070* 
nn 3080 " nn 3090* nn 3090 ' nn 3100» 
nn 3110 " nn 3120» nn 3120 ' nn 3130- 
nn 3140 " nn 3150' nn 3150 * nn 3160» 
nn 3170 * nn 3180! nn 3180 " nn 3190- 
nn 3200 _ nn 3210» nn 3210 " nn 3220» 
nn 3230 _ nn 3240> nn 3240 * nn 3250' 
nn 3260 ' nn 3270! nn 3270 " nn 3280' 
nn 3290 * nn 330()J nn 3300 * nn 3310» 
nn 3320 * nn 3330' nn 3330 " nn 3340- 
nn 3350 _ nn 3360» nn 3360 " nn 3370» 
nn 3380 * nn 3390i nn 3390 * nn 3400i 
nn 3410 _ nn 3420> nn 3420 " nn 3430» 
nn 3440 _ nn 3450» nn 3450 " nn 3460» 
nn 3470 _ nn 3480: nn 3480 " nn 3490^ 
nn 3500 _ nn 3510! nn 3510 " nn 3520' 
nn 3530 ' nn 3540: nn 3540 * nn 3550l 
nn 3560 * nn 3570» nn 3570 * nn 3580- 
nn 3590 " nn 3600^ nn 3600 " nn 3610» 
nn 3620 " nn 3630* nn 3630 " nn 3640 ; 
nn 3650 " nn 3660* nn 3660 * nn 3670> 
nn 3680 * nn 3690' nn 3690 * nn 3700' 
nn 3710 " nn 3720l nn 3720 " nn 3730l 
nn 3740 _ nn 3750- nn 3750 " nn 3760l 
nn 3770 " nn 3780» nn 3780 " nn 379Ql 
nn 3800 " nn 3810- nn 3810 - nn 3820» 
nn 3830 " nn 3840» nn 3840 " nn 3850 ; 


nn 2140 * 

nn 2150 

nn 2170 - 

nn 2180 

nn 2200 " 

nn 2210 

nn 2230 ' 

nn 2240 

nn 2 26o * 

nn 2270 

nn 2290 " 

nn 2300 

nn 2320 ' 

nn 2330 

nn 2350 * 

nn 2360 

nn 2380 " 

nn 2390 

nn 2410 - 

nn 2420 

nn 2440 " 

nn 2450 

nn 2470 - 

nn 2480 

nn 2500 * 

nn 2510 

nn 2530 ' 

nn 2540 

nn 2560 * 

nn 2570 

nn 2590 _ 

nn 2600 

nn 2620 " 

nn 2630 

nn 2650 * 

nn 2660 

nn 2680 " 

nn 2690 

nn 2710 - 

nn 2720 

nn 27 4o - 

nn 2750 

nn 2770 - 

nn 2780 

nn 2800 ' 

nn 2810 

nn 2830 " 

nn 2840 

nn 2860 " 

nn 2870 

nn 2890 " 

nn 2900 

nn 2920 ' 

nn 2930 

nn 2950 ' 

nn 2960 

nn 2980 " 

nn 2990 

nn 3010 " 

nn 3020 

nn 3040 ' 

nn 3050 

nn 3070 - 

nn 3080 

nn 3100 _ 

nn 3110: 

nn 3130 " 

nn 314(V 

nn 3160 " 

nn 3170 

nn 3190 " 

nn 3200 

nn 3220 " 

nn 3230 

nn 3250 * 

nn 3260 

nn 3280 - 

nn 3290 

nn 3310- 

nn 3320 

nn 3340 " 

nn 3350 

nn 3370 " 

nn 3380 

nn 3400 - 

nn 3410 

nn 3430 " 

nn 3440 

nn 3460 " 

nn 3470> 

nn 3490 ' 

nn 3500* 

nn 3520 ' 

nn 3530 

nn 3550 - 

nn 3560* 

nn 3580 - 

nn 3590 

nn 3610 * 

^^3620' 

nn 3640 " 

^^3650' 

nn 3670 " 

nn 3680> 

nn 3700 " 

nn 3710' 

nn 3730 ■ 

nn 3740: 

nn 3760 * 

nn 3770" 

nn 3790 " 

nn 3800" 

nn 3820 " 

nn 3830* 

nn 3850 * 

nn 3860 
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nn 3860 " 

nn 3870 

nn 3870 * 

nn 3880 

I nn 3880 * 

nn 3890 ; 


nn 3890 " 

nn 3900 

nn 390O " 

nn 3910 

I nn 3910 " 

nn 3920 ; 


11 "3920 

nn 3930 

nn 3930 " 

nn 3940 

' nn 3940 " 

nn 3950> 


""3950 

nn 3960 

nn 3960 " 

nn 3970 

» nn 3970 " 

nn 3980' 

5 

nn 3980 * 

nn 3990 

nn 3990 " 

nn 4O00 

, nn 4 Qoo - 

fin a n* n 

""4010' 


""4010 

nn 4020 

nn 4020 " 

nn 4030 

* nn 4030 " 

,M, 4040' 


nn 4040 " 

nn 4050 

nn 4050 " 

nn 4060 

I nn 4060 " 

nn 4070' 


nn 4070 " 

nn 4080 

nn 4080 " 

nn 4090 

I nn 4090 " 

nn a a nn' 

1 "'4100' 


nn 4100 " 

nn 4110 

nn 4110 ' 

nn 4120 

nn 4120 ' 

nn 4130 ; 

10 

nn 4130 " 

nn 4140 

nn 4i40 - 

nn 4150 

» nn 4150 " 

nn 4160i 


nn 4160 " 

nn 4170 

nn 4170 " 

nn 4180 

nn 4180 " 

nn 4190» 


nn 4190 " 

nn 4200 

nn 4200 " 

nn 4210 

" nn 4210 " 

nn 4220 ; 


nn 4220 " 

nn 4230 

""4230 

nn 4240 

nn 4240 " 

nn 4250» 


nn 4250 " 

nn 4260 

nn 4260 ' 

nn 4270 

» nn 4270 " 

nn 4280* 

15 

nn 4280 " 

nn 4290 

nn 4290 ' 

nn 4300 

nn 4300 " 

nn 4310* 


nn 4310 " 

nn 4320 

nn 4320 " 

nn 4330 

» nn 4330 " 

nn 4340' 


nn 4340 " 

nn 4350 

nn 4350 " 

nn 4360 

» nn 4360 " 

nn 4370' 


nn 4370 * 

nn 4380 

nn 4380 " 

nn 4390 

nn 4390 " 

nn 4400» 


nn 4400 " 

nn 4410 

nn 4410 " 

nn 4420 

nn 4420 " 

nn 4430» 

20 

nn 4430 *" 

nn 4440 

nn 4440 - 

nn 4450 

nn 4450 " 

nn 4460' 


nn 4460 " 

nn 4470 

nn 4470 " 

nn 4480 

» nn 4480 " 

nn 4490 ; 


nn 4490 ' 

nn 4500 

nn 4500 " 

nn 4510 

nn 4510 * 

nn 4520! 


nn 4520 " 

nn 4530 

nn 4530 " 

nn 4540 

nn 4540 " 

nn 4550^ 


nn 4550 ' 

nn 4560 

nn 4560 * 

nn 4570 

nn 4570 " 

nn 4580> 

25 

nn 4580 " 

nn 4590 

nn 4590 " 

nn 4600 

nn 4600 " 

nn 4610 ; 


nn 4610 " 

nn 4620 

nn 4620 " 

nn 4630 

nn 4630 ' 

nn 4640> 


nn 4640 - 

nn 4650 

nn 4650 " 

nn 4660 

nn 4660 " 

nn 4670^ 


nn 4670 " 

nn 4680 

nn 4680 " 

nn 4690 

nn 4690 " 

nn 4700 ; 


nn 4700 " 

nn 4710 

nn 4710 " 

nn 4720 

nn 4720 " 

nn 4730l 

30 

nn 4730 * 

nn 4740 

nn 4740 - 

nn 4750 

nn 4750 " 

nn 4760^ 


nn 4760 " 

nn 4770 

nn 4770 * 

nn 4780 

nn 4780 " 

nn 4790 ; 


nn 4790 " 

nn 4800 

nn 4800 " 

nn 4810 

nn 4810 " 

nn 4820: 


nn 4820 " 

nn 4830 

nn 4830 ' 

nn 4840 

nn 4840 " 

nn 4850- 


nn 4850 " 

nn 4860 

nn 4860 " 

nn 4870 

nn 4870 " 

nn 4880> 

35 

nn 4880 * 

nn 4890 

nn 4890 " 

nn 4900 

nn 4900 " 

nn 4910 ; 


nn 4910 " 

nn 4920 

nn 4920 " 

nn 4930 

nn 4930 " 

nn 4940i 


nn 4940 " 

nn 4950 

nn 4950 " 

nn 4960 

nn 4960 " 

nn 4970 ; 


nn 4970 " 

nn 4980 

nn 4980 " 

nn 4990 

nn 4990 " 

nn 5000i 


nn 5000 ' 

nn 5010 

nn 5O10 " 

nn 5020 

nn 5020 ' 

nn 5030« 

40 

nn 5030 " 

nn 5040 

nn 5040 " 

nn 5050 

nn 5050 ' 

nn 5060l 


nn 5060 * 

nn 5070 

nn 5070 * 

nn 5080 

nn 5080 " 

nn 5090« 


nn 5090 " 

nn 5100 

nn 5100 " 

nn 5110 

nn 5110 ' 

nn 5120' 


nn 5120 * 

nn 5130 

nn 5130 * 

nn 5140 

nn 5140 " 

nn 5150J 


nn 5150 " 

nn 5160 

nn 5160 " 

nn 5170 

nn 5170 " 

nn 5180^ 

45 

nn 5180 " 

nn 5190 

nn 5190 * 

nn 520O 

nn 5200 " 

nn 5210i 


nn 5210 * 

nn 5220 

nn 5220 * 

nn 5230 

nn 5230 " 

nn 5240 ; 


nn 5240 " 

nn 5250 

nn 5250 " 

nn 5260 

nn 5260 _ 

nn 527o; 


nn 5270 ~ 

nn 5280 

nn 5280 ' 

nn 5290 

nn 5290 * 

nn 5300i 


nr l5300 " 

QD5310 

, n I?53.10_" 

"1*5320 

_ n Q5320 ' 

J? n .533oi_ 

50 

nn 5330 * 

nn 5340 

nn 5340 * 

nn 5350 

nn 5350 • 

nn 5360l 


nn 5360 " 

nn 5370 

nn 5370 " 

nn 5380 

nn 5380 " 

nn 5390: 


nn 5390 " 

nn 5400 

nn 5400 * 

™5410 

nn 5410" 

nn 5420- 


nn 5420 " 

nn 5430 

nn 5430 ' 

nn 5440 

nn 5440 " 

nn 5450: 


nn 5450 " 

nn 5460 

nn 5460 * 

nn 5470 

nn 5470 ' 

nn 5480*' 

55 

nn 5480 * 

nn 5490 

nn 5490 * 

nn 5500 

nn 5500" 

nn 5510: 


nn 5510 * 

nn 5520 

nn 5520 " 

nn 5530 

nn 5530 " 

nn 5540 ; 


nn 5540 * 

nn 5550 

nn 5550 " 

nn 5560 

nn 5560 " 

nn 5570: 


nn 5570 " 

nn 5580 

nn 5580 " 

nn 5590 

nn 5590 * 

nn 5600 ; 
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10 


15 


20 


25 


30 


35 


40 


45 


50 


nn 5600 " 

nn 5610 

I nn 5610 ' 

nn 5620' 

nn 5620 " 

nn 5630» 

nn 5630 - 

nn 5640 

: nn 5640 " 

nn 5650- 

nn 5650 " 

nn 5660: 

nn 5660 ' 

nn 5670 

I nn 5670 " 

nn 5680» 

nn 5680 " 

nn 5690 ; 

nn 5690 ' 

nn 5700 

> nn 5700 ' 

nn 5710- 

nn 5710 " 

nn 5720- 

nn 5720 * 

nn 5730 

I nn 5730 " 

nn 5740: 

nn 5740 _ 

nn 5750: 

nn 5750 ' 

nn 5760 

I nn 5760 " 

nn 5770: 

nn 5770 * 

nn 578(V 

nn 5780 ' 

nn 5790 

I nn 5790 " 

nn 5800» 

nn 5800 " 

nn 5810; 

nn 5810" 

nn 5820 

I nn 5820 " 

nn 5830> 

nn 5830 " 

nn 584(>; 

nn 5840 " 

nn 5850 

> nn 5850 ' 

nn 5860» 

nn 5860 " 

nn 5870; 

nn 5870 * 

nn 5880 

; nn 5880 ' 

nn 5890' 

nn 5890 * 

nn 5900> 

nn 5900 " 

nn 5910 

> nn 5910' 

nn 5920i 

nn 5920 " 

nn 5930 : 

nn 5930 " 

nn 5940 

^ nn 5940 ' 

""5950^ 

nn 5950 " 

nn 5960- 

nn 5960 " 

nn 5970 

* nn 5970 ' 

nn 5980 ; 

nn 5980 " 

nn 5990 ; 

nn 5990 * 

nn 6000 

! nn 6000 ' 

nn 601(V 

nn 6010 " 

nn 6020' 

nn 6020 " 

nn 6030 

I nn 6030 ' 

nn 6040» 

nn 6040 " 

nn 6050 ; 

nn 6050 " 

nn 6060 

I nn 6O60 " 

nn 6070» 

nn 6070 ~ 

nn 6080» 

nn 6080 " 

nn 6090 

I nn 6090 " 

nn 6100' 

nn 6100 " 

nn 6110 ; 

nn 6110 " 

nn 6120 

nn 6120 " 

nn 6130> 

nn 6130 " 

nn 6140^ 

nn 6140 _ 

nn 6150 

I nn 6150 " 

nn 6160» 

nn 6160 " 

nn 6170- 

nn 6170 " 

nn 6180 

> ^^6180 ~ 

n ^6190» 

nn 6190 " 

nn 6200» 

nn 6200 " 

nn 6210 

I nn 6210 " 

nrig22oI 

nn 6220 " 

nn 6230* 

nn 6230 ~ 

nn 6240 

» nn 6240 " 

nrig25o! 

nng 2 5Q " 

nn 6260' 

n ^6260 " 

nrig270 

' nn 6270 ~ 

nn 6280» 

nn 6280 ~ 

nng 2 gg; 

nn 6290 " 

nn 6300 

I nn 6300 " 

nn 6310» 

nn 6310 " 

nn 6320* 

nn 6320 * 

nn 6330 

» nn 6330 " 

^ n 6340* 

nrig^Q - 

nn 6350* 

nng35Q - 

nrig3gQ 

I ^6360 " 

^6370* 

nng3 7 Q - 

nng3ggi 

nn 6380 " 

nn 6390 

nn 6390 " 

nn 6400« 

nn 6400 " 

nn 6410 ; 

nn 6410 " 

nn 6420 

nn 6420 " 

nn 6430' 

nn 6430 " 

nn 6440* 

nn 6440 " 

nn 6450 

* nn 6450 * 

nrtg4gg; 

nrig 4 gQ - 

nn 6470i 

nn 6470 " 

nrt 6480 

nrig4QQ - 

^^6490' 

nn 6490 " 

nn 6500» 

nriggQQ - 

nn 6510 

nn 6510 " 

nn 6520» 

nrig52o ■ 

^^6530' 

nn 6530 " 

nn 6540 

nn 6540 ' 

nn 6550 ; 

nn 6550 " 

nnggggl 

nn 6560 * 

nn 6570 

nn 6570 ' 

nn 6580» 

nrigggg - 

nn 6590' 

nn 6590 " 

nn 6600 

nn 6600 " 

nn 6610' 

nn 6610 " 

nngg2Qi 

nn 6620 * 

nn 6630 

nn 6630 " 

nn 6640' 

nn 6640 " 

nn 6650« 

nn 6650 * 

nn 6660 

rtn 6660 ' 

nn 6670> 

nn 6670 " 

nn 6680' 

nn 6680 " 

nn 6690 

nn 6690 " 

nn 6700» 

nn 6700 " 

nn 6710 ; 

nn 6710 " 

nn 6720 

nn 6720 " 

nn 6730» 

nn 6730 - 

nn 6740: 

nn 6740 " 

nn 6750 

nn 6750 ' 

nn 6760l 

nn 6760 " 

nn 6770- 

nn 6770 - 

nn 6780 

nn 6780 " 

nn 6790» 

nn 6790 " 

nn 6800> 

nn 6800 " 

nn 6810 

nrigg-|Q - 

nrigg2oI 

nrigQ2o ■ 

nngg3Q; 

nn 6830 " 

nn 6840 

nn 6840 * 

nrigggg; 

nrigggQ - 

nngggg; 

nn 6860 " 

nn 6870 

nn 6870 " 

nn 6880' 

nn 6880 " 

nn 6890» 

nn 6890 " 

nriggoo 

nrigggQ - 

^6910' 

nngg-jQ - 

nngg2o! 

nrig92o " 

nn 6930 

nngg3Q - 

nrigg^Qj 

nrigg 4 Q - 

nnggggl 

nn 6950 * 

nrigggQ 

nrigggg - 

nrigg 7 Q; 

nngg 7 Q - 

nnggggj 

nrigggQ - 

riRgggQ 

rWgggg - 

^ n 7000» 

nn 7000 " 

nn 7 Q 10 ; 

nn 7010 " 

nri702o 

nn 7 Q2o " 

nn 7 o3o! 

nn 7 Q3Q - 

nn 7 Q 4 Qi 

^^7040 " 

nn 7050 

nn 7050 * 

nn 7 QQQi 

nn 7 Qgg - 

nn 7070 ; 

nn 7070 " 

nn 7080 

nn 7080 " 

nn 7 QgQj 

nn 7090 " 

nn 7 ^QQj 

nn 7100 " 

nn 7110» 

nn 7110 " 

nn 7120' 

nn 7120 " 

nn 7130' 

nn 7130" 

nn 7140 

nn 7140 - 

nn 7150: 

nn 7150' 

nn 7160: 

nn 7160" 

nn 7170 

nn 7170* 

nn 7180: 

nn 7180- 

nn 7190; 

nn 7190 * 

nn 7200 

nn 7200 ' 

nn 7210 ; 

nn 7210 - 

nn 722 o; 

nn 7220 " 

nn 7230 

nn 7230 * 

nn 7240 ; 

nn 7240 - 

nn 7250: 

nn 7250 " 

nn 7260 

nn 7260 _ 

nn 7270l 

nn 7270 - 

nn 7280l 

nn 7280 " 

nn 7290 

nn 7290 " 

nn 7300- 

nn 7300 " 

nn 7310: 

nn 7310" 

nn 7320 

nn 7320 " 

nn 7330- 

nn 7330 ' 

nn 7340: 
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5 


10 


15 


20 


25 


30 


35 


40 


45 


50 


55 



r 03U 

nn 7 «n ■ 

( oou 

nn7-ien - 

f oou 

/ Of u 

/ o/U 

nri7oon 

/ oou 

nri7oon - 

f oou 

r oyu 

nn70Qn ■ 

f oyu 

nri7jinn 

f 4UU 

/4UU 



nn7^-3n 

f **^u 


f 40U 

V43U 


f **4U 

1 40U 

/ 40U 

nn 7 4cn 

/ 40U 

'7460 

74/U 

V4f 0 

V48U 


V 4yo 

7490 

i 5UU 

V 5UU 

/DlU 


7520 

7520 

/5JU 

V530 


/ 5**U 

7550 

nri7ccn ~ 

'"7550 

/ 5oU 

V5b0 

" '7570 

*" V570 

7580 

7580 

/59U 

7590 

VbOO 

VoUU 

7610 

7610 

( b2U 

f b20 

vboo 

/boO 

7640 

/04U 


/oou 

/OOU 

f oou 

/ Of U 

7b70 

f oou 

/OOU 

f oyu 

tow 

7700 

7700 

f 7 1U 

/ /1U 

f / CM 



'"7730 

f /4U 

nri77^n - 

f 74U 

V /5U 

V f50 

7760 

'7760 

nri777n 

7770 

'7770 

'7780 

7780 

7790 

7790 

VoUU 

VoOO 

" VoiO 

nri7Qin - 

""78 10 

'7820 

' "'7820 

7830 

'7830 

7840 

7840 

""7850 

'"7850 

"7860 

1 7860 

7870 

7870 

7880 

'7880 

' "7890 

' " '7890 

' "'7900 

' " '7900 

' " '7910 

' '"7910 

1 7920 

7920 

7930 

7930 

nrivnAn 

' '7940 

' '7940 

nn-7oen 
' '"7950 

' " '7950 

'7960 

II |l -7 r\s? nt ~ 

' " '7960 

7970 

nn7(V7n ■ 

"7970 

V9B0 

V980 

7990 

nri7fmn — 

"8000 

8000 

8010 

8010 

' "8020 

'8020 

ll Mo 

8030 

0030 

OU4U 

ll M q A/t a ~ 
OU4U 

nriortcrt 
OU5U 

nrtorico ~ 

OU5U 

If 11 o rtc n 
80b0 

n Morten ~ 
oObO 

80/ U 

oO/U 

oOBO 

8080 

n n o aha 

8090 

1 1 II o aa a * 

8090 

firio 4 nn 
' 8100 

' " '8100 

"' '8110 

"' '6110 

'8120' 

8120 

8130 

8130 

' o14U 

"' 8140 

' 8150 

"8150 

""8160 

'8160 

' "8170 

' " '8170 

'8180 

""8180 " 

'"'8190 

" 8190 

""8200 

II fl niAn • 

1 " '8200 

' " '8210 

" n 8210 " 

""8220 

""8220 ~ 

""8230 

""8230 ~ 

""8240 

""8240 * 

""8250 

'"'8250 

""8260 

' " '8260 

""8270 

""8270 " 

""8280 

" n 8280 " 

""8290 

""8290 " 

""8300 

" n 8300 " 

" n 8310 

""8310 ~ 

""8320 

""8320 " 

" n 8330 

""8330 " 

""8340 

""8340 " 

""8350 

""8350 " 

""8360 

nn 8360 " 

" n 8370 

" n 8370 " 

""8360 

""8380 " 

""8390 

""8390 " 

""8400 

""8400 " 

8410 

""8410 

""8420 

""8420 " 

""8430 

""8430 ~ 

""8440 

nrio a An * 

MM 8440 

""8450' 

8450 

8460 

" '8460 

' ' 8470 

'" '8470 

n Tto a oft 

1 " 8480 

""8480 " 

""8490 

""8490 " 

""8500 

""8500 ~ 

1 "'8510' 

""8510 

""8520 

""8520 " 

""8530 

""8530 * 

' '"8540' 

""8540 

""8550 

,m 8550 

""8560 

""8560 * 

1 " '8570' 

""8570 

" n 8580 

nn 8580 " 

""8590 

" n 8590 " 

""8600» 

,m 8600 

""8610 

""8610 * 

""8620 

""8620 

""8630' 

" n 8630 " 

" n 8640 

" n 8640 " 

" n 8650 

" n 8650 " 

* " '8660' 

" n 8660 " 

" n 8670 

" n 8670 * 

" n 8680 

""8680 ' 

""8690' 

" n 8690 * 

" n 8700 

" n 8700 * 

" n 8710 

""6710 * 

""8720' 

nn 8720 " 

""6730 

" n 8730 " 

" n 8740 

" n 8740 * 

""8750 

""8750 

""8760 

""8760 " 

" n 8770 

" n 8770 " 

" n 8780 

""8780J!. 

„""_8790_ 

" n 8790 ' 

""8800 

nn 88O0/ 

^"8810 

""8810 

" n 8820 

" n 8820 " 

" n 8830 

nn 8830 " 

n,, 8840 

" n 8840 " 

nn 8850 

""8850 " 

" n 8860 

""8860 " 

""8870' 

nn 8870 ' 

" n 8880 

""8880 " 

""8890 

""8890 " 

""8900' 

""8900 _ 

""8910 

""8910 " 

" n 8920 

""8920 _ 

""8930' 

nn 8930 " 

" n 8940 

""8940 * 

" n 8950 

""8950 " 

""8960* 

""8960 _ 

""8970 

""8970 " 

" n 8980 

""8980 ' 

""8990* 

nn8990 - 

""9000 

""9000 • 

""9010 

""9010* 

""9020* 

n"9020 - 

nn 9030 

""9030 " 

""9040 

""9040 ' 

nn 9050 ; 

" n 9050 ' 

""9060 






16 


EP 0 398 748 B1 


[0095] The oligomer, however, need not consist only of the sequence which is complementary to the targeted HCV 
sequence. It may contain in addition, nucleotide sequences or other moieties which are suitable for the purposes for 
which the oligomers are used. For example, if the oligomers are used as primers for the amplification of HCV sequences 
via PGR, they may contain sequences which, when in duplex, form restriction enzyme sites which facilitate the cloning 

5 of the amplified sequences. 

[0096] The preparation of the oligomers is by means known in the art, including, for example, by methods which 
include excision, transcription, or chemical synthesis. The target sequences and/or regions of the genome which are 
selected to which the targeting polynucleotides of the oligomers are complementary depend upon the purpose. For 
example, if the goal is to screen for the presence of HCV in biological samples (e.g. blood), the preferred oligomers 

10 woutd be used as primers, and would hybridize to conserved regions of the HCV genome. Some of the conserved 
regions of the HCV genome to which the oligomers may bind are described herein, for example, the regions which 
include nucleotide numbers from about the 5-terminus to about 200, or from about 4000 to about 5000, or from about 
8000 to about 9040 as shown in Fig. 18, or preferably nucleotides -318 to 174, 4056 to 4448, and 4378 to 4902. Other 
regions of the genome which are conserved are readily ascertainable by comparison of the nucleotide sequences of 

1$ various isolates of HCV, including the prototype HCV, HCV1 . Methods for conducting comparisons between genotypes 
to determine conserved and nonconserved regions are known in the art, and examples of these methods are disclosed 
herein. 

[0097] For use of such probes as agents to detect the presence of HCV polynucleotides (for example in screening 
for contaminated blood), the biological sample to be analyzed, such as blood or serum, may be treated, if desired, to 

20 extract the nucleic acids contained therein. The resulting nucleic acid from the sample may be subjected to gel elec- 
trophoresis or other size separation techniques; alternatively, the nucleic acid sample may be dot blotted without size 
separation. In order to form hybrid duplexes with the targeting sequence of the probe, the targeted region of the analyte 
nucleic acid must be in single stranded form. Where the sequence is naturally present in single stranded form, dena- 
turation will not be required. However, where the sequence is present in double stranded form, the sequence will be 

25 denatured. Denaturation can be carried out by various techniques known in the art. Subsequent to denaturation, the 
analyte nucleic acid and probe are incubated under conditions which promote stable hybrid formation of the target 
sequence in the probe with the putative targeted sequence in the analyte, and the resulting duplexes containing the 
probe (s) are detected. 

[0098] Detection of the resulting duplex, if any, is usually accomplished by the use of labeled probes; alternatively, 
30 the probe may be unlabeled, but may be detectable by specific binding with a ligand which is labeled, either directly 
or indirectly. Suitable labels, and methods for labeling probes and ligands are known in the art, and include, for example, 
radioactive labels which may be incorporated by known methods (e.g., nick translation or kinasing), biotin, fluorescent 
groups, chemiluminescent groups (e.g., dioxetanes, particularly triggered dioxetanes), enzymes, antibodies, and the 
like. 

35 [0099] The region of the probes which are used to bind to the analyte can be made completely complementary to 
the HCV genome. Therefore, usually high stringency conditions are desirable in order to prevent false positives. How- 
ever, conditions of high stringency should only be used if the probes are complementary to regions of the viral genome 
which lack heterogeneity. The stringency of hybridization is determined by a number of factors during hybridization 
and during the washing procedure, including temperature, ionic strength, length of time, and concentration of forma- 

40 mide. These factors are outlined in, for example, Maniatis, T. (1982). 

[0100] Variations of this basic scheme which are known in the art, including those which facilitate separation of the 
duplexes to be detected from extraneous materials and/ or which amplify the signal from the labeled moiety, may also 
be used. A number of these variations are reviewed in, for example: Matthews and Kricka (1988), Analytical Biochem- 
istry 169:1; Landegren et al. (1988), Science 242:229; and Mittlin (1989), Clinical chem. 35:1819. These and the fol- 

45 lowing publications describing assay formats are hereby incorporated by reference herein. Probes suitable for detecting 
HCV in these assays are comprised of sequences which hybridize with target HCV polynucleotide sequences to form 
duplexes with the analyte strand, wherein the duplexes are of sufficient stability for detection in the specified assay 
system. 

50 Detection of HCV RNA and Polynucleotides Derived Therefrom Using an HCV/cPCR Method 

[0101] A particularly useful method for detecting HCV RNA or polynucleotides derived from HCV RNA is the HCV/ 
cPCR method, which is a subject of the herein application, and which utilizes the polymerase chain reaction technique 
(PCR) which is described by Saiki et al. (1986), by Mullis in U.S. Pat No. 4,683,195, and by Mullis et al. in U.S. Patent 
55 No. 4,683,202. The HCV/cPCR method utilizes primers and probes derived from the information provided herein con- 
cerning the nature of the HCV genome. 

[0102] Generally, in the PCR technique, short oligonucleotide primers are prepared which match opposite ends of 
a desired sequence. The sequence between the primers need not be known. A sample of polynucleotide is extracted 
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and denatured, preferably by heat, and hybridized with oligonucleotide primers which are present in molar excess. 
Polymerization is catalyzed by a template- and primer-dependent polymerase in the presence of deoxynucleotide 
triphosphates or nucleotide analogs (dNTPs). This results in two "long products" which contain the respective primers 
at their S'-termini, covalently linked to the newly synthesized complements of the original strands. The replicated DNA 
is again denatured, hybridized with oligonucleotide primers, returned to polymerizing conditions, and a second cycle 
of replication is initiated. The second cycle provides the two original strands, the two long products from cycle 1, and 
two "short products" replicated from the long products. The short products contain sequences (sense or antisense) 
derived from the target sequence, flanked at the 5'- and 3Mermini with primer sequences. On each additional cycle, 
the number of short products is replicated exponentially. Thus, this process causes the amplification of a specific target 
sequence. 

[0103] In the method, a sample is provided which is suspected of containing HCV RNA, or a fragment thereof. The 
sample is usually taken from an individual suspected of having NANBH; however, other sources of the sample are 
included, e.g., conditioned medium or cells from in vitro systems in which the virus has been replicated. The sample, 
however, must contain the target nucleic acid sequence(s). 

[0104] The sample is then subjected to conditions which allow reverse transcription of HCV RNA into HCV cDNA. 
Conditions for reverse transcribing RNA are known to those of skill in the art, and are described in, for example, Maniatis 
et al. (1982), and in Methods in Enzymology. A preferred method of reverse transcription utilizes reverse transcriptase 
from a variety of sources, including recombinant molecules, and isolated from, for example, a retrovirus, preferably 
from avian myeloblastosis virus (AMV), and suitable conditions for the transcription. The HCV cDNA product of reverse 
transcription is in a RNArDNA hybrid, which results from the first round of reverse transcription; subsequently, DNA: 
DNA hybrids result from two or more rounds of transcription. 

[0105] The HCV cDNA resulting from reverse transcription is then subjected to PCR to amplify the target sequence, 
in order to accomplish this, the HCV cDNA is denatured, and the separated strands are hybridized with primers which 
flank the target sequence. 

[0106] Strand separation may be accomplished by any suitable denaturing method, including physical, chemical, or 
enzymatic means, which are known to those of skill in the art. A preferred method, which is physical, involves heating 
the nucleic acid until it is completely (>99%) denatured. Typical heat denaturation involves temperatures ranging from 
about 80°C to about 105°C, for times ranging from about 1 to 10 minutes. 

[0107] After hybridization of the HCV cDNA with the primers, the target HCV sequences are replicated by a polym- 
erizing means which utilizes a primer oligonucleotide to initiate the synthesis of the replicate chain. The primers are 
selected so that they are complementary to sequences of the HCV genome. Oligomeric primers which are comple- 
mentary to regions of the sense and antisense strands of HCV cDNA can be designed from the HCV cDNA sequences 
from the composite cDNA sequence provided in Fig. 18. 

[0108] The primers are selected so that their relative positions along a duplex sequence are such that an extension 
product synthesized from one primer, when it is separated from its template (complement), serves as a template for 
the extension of the other primer to yield a replicate chain of defined length. 

[0109] The primer is preferably single stranded for maximum efficiency in amplification, but may alternatively be 
double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare 
extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime 
the synthesis of extension products in the presence of the agent for polymerization. The exact lengths of the primers 
will depend on many factors, including temperature and source of the primer and use of the method. For example, 
depending on the complexity of the target sequence, the oligonucleotide primer typically contains about 15-45 nucle- 
otides, although it may contain more or fewer nucleotides. Short primer molecules generally require cooler temperatures 
to form sufficiently stable hybrid complexes with the template. 

[0110] The primers used herein are selected to be "substantially 1 * complementary to the different strands of each 
specific sequence to be amplified. Therefore, the primers need not reflect the exact sequence of the template, but must 
be sufficiently complementary to selectively hybridize with their respective strands. For example, a non-complementary 
nucleotide fragment may be attached to the 5'-end of the primer, with the remainder of the primer sequence being 
complementary to the strand. Alternatively, non-complementary bases or longer sequences can be interspersed into 
the primer, provided* "that the ~primeT has Sufficient complementarity with the^equence of dne of 'the strands to be~ 
amplified to hybridize therewith, and to thereby form a duplex structure which can be extended by the polymerizing 
means. The non-complementary nucleotide sequences of the primers may include restriction enzyme sites. Appending 
a restriction enzyme site to the end(s) of the target sequence would be particularly helpful for cloning of the target 
sequence. 

[0111] It will be understood that "primer", as used herein, may refer to more than one primer, particularly in the case 
where there is some ambiguity in the information regarding the terminal sequence(s) of the target region to be amplified. 
Hence, a "primer" includes a collection of primer oligonucleotides containing sequences representing the possible 
variations in the sequence or includes nucleotides which allow a typical basepairing. One of the primer oligonucleotides 
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in this collection will be homologous with the end of the target sequence. A specific case is shown in the Examples, 
where oligomer sets of 44-mers and 45-mers were utilized to prime the amplification of a potentially variant region of 
the HCV genome. 

[0112] It is anticipated that there will be a variety of strains or isolates of HCV with sequences which deviate from 
5 HCV1, the prototype strain. Therefore, in order to detect variant strains it is preferable to construct primers which 
hybridize to conserved regions of the HCV genome. The conserved regions may be determined by comparing the 
nucleotide or amino acid sequences of several HCV strains/isolates. There appear to be at least three regions of 
conserved amino acid in the HCV genome, described supra., from which primers may be derived. These regions are 
believed to be. The primers described infra., in the Examples, are derived from what are believed to be conserved 
10 regions of HCV, based upon sequence homology to that of the Flaviviruses. 

[0113] The oligonucleotide primers may be prepared by any suitable method. Methods for preparing oligonucleotides 
of specific sequence are known in the art, and include, for example, cloning and restriction of appropriate sequences, 
and direct chemical synthesis. Chemical synthesis methods may include, for example, the phosphotriester method 
described by Narang et al. (1979), the phosphodiester method disclosed by Brown et al. (1979), the diethylphospho- 
15 ramidate method disclosed in Beaucage et al. (1981), and the solid support method in U.S. Patent No. 4,458,066. 
[01 14] The primers may be labeled, if desired, by incorporating means detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. 

[0115] Template-dependent extension of the oligonucleotide primer(s) is catalyzed by a polymerizing agent in the 
presence of adequate amounts of the four deoxyribonucleotide triphosphates (dATP, dGTP, dCTP and dTTP) or ana- 
20 logs, in a reaction medium which is comprised of the appropriate salts, metal cations, and pH buffering system. Suitable 
polymerizing agents are enzymes known to catalyze primer- and template-dependent DNA synthesis. Known DNA 
polymerases include, for example, E. coli DNA polymerase I or its Klenow fragment, T 4 DNA polymerase, and Taq 
DNA polymerase. The reaction conditions for catalyzing DNA synthesis with these DNA polymerases are known in the 
art. 

25 [0116] The products of the synthesis are duplex molecules consisting of the template strands and the primer exten- 
sion strands, which include the target sequence. These products, in turn, serve as template for another round of rep- 
lication. In the second round of replication, the primer extension strand of the first cycle is annealed with its comple- 
mentary primer; synthesis yields a "short" product which is bounded on both the 5'-and the 3'-ends by primer sequences 
or their complements. Repeated cycles of denaturation, primer annealing, and extension result in the exponential 

30 accumulation of the target region defined by the primers. Sufficient cycles are run to achieve the desired amount of 
polynucleotide containing the target region of nucleic acid. The desired amount may vary, and is determined by the 
function which the product polynucleotide is to serve. 

[0117] The PCR method can be performed in a number of temporal sequences. For example, it can be performed 
step-wise, where after each step new reagents are added, or in a fashion where all of the reagents are added simul- 

35 taneously, or in a partial step-wise fashion, where fresh reagents are added after a given number of steps. 

[01 18] In a preferred method, the PCR reaction is carried out as an automated process which utilizes a thermostable 
enzyme. In this process the reaction mixture is cycled through a denaturing region, a primer annealing region, and a 
reaction region. A machine may be employed which is specifically adapted for use with a thermostable enzyme, which 
utilizes temperature cycling without a liquid handling system, since the enzyme need not be added at every cycle. This 

40 type of machine is commercially available from Perkin Elmer Cetus Corp. 

[01 19] After amplification by PCR, the target polynucleotides are detected by hybridization with a probe polynucle- 
otide which forms a stable hybrid with that of the target sequence under stringent to moderately stringent hybridization 
and wash conditions. If it is expected that the probes will be completely complementary (i.e., about 99% or greater) to 
the target sequence, stringent conditions will be used. If some mismatching is expected, for example if variant strains 

45 are expected with the result that the probe will not be completely complementary, the stringency of hybridization may 
be lessened. However, conditions are chosen which rule out nonspecific/adventitious binding. Conditions which affect 
hybridization, and which select against nonspecific binding are known in the art, and are described in, for example, 
Maniatis et al. (1982). Generally, lower salt concentration and higher temperature increase the stringency of binding. 
For example, it is usually considered that stringent conditions are incubation in solutions which contain approximately 

so 0.1 X SSC, 0.1% SDS, at about 65°C incubation/wash temperature, and moderately stringent conditions are incubation 
in solutions which contain approximately 1-2 X SSC, 0.1% SDS and about 50°-65°C incubation/wash temperature. 
Low stringency conditions are 2 X SSC and about 30°-50°C. 

[0120] Probes for HCV target sequences may be derived from the HCV cDNA sequence shown in Fig. 18, or from 
new HCV isolates. The HCV probes may be of any suitable length which span the target region, but which exclude the 
55 primers, and which allow specific hybridization to the target region. If there is to be complete complementarity, i.e., if 
the strain contains a sequence identical to that of the probe, since the duplex will be relatively stable under even 
stringent conditions, the probes may be short, i.e., in the range of about 10-30 base pairs. If some degree of mismatch 
is expected with the probe, i.e., if it is suspected that the probe will hybridize to a variant region, the probe may be of 
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greater length, since length seems to counterbalance some of the effect of the mismatch(es). An example of this is 
found in the Examples, where the probe was designed to bind to potential variants of HCV1. In this case, the primers 
were designed to bind to HCV cDNA derived from a hypothetical conserved region of the HCV genome, and the target 
region was one which potentially contained variations (based upon the Flavivirus model). The probe used to detect 
5 the HCV target sequences contained approximately 268 base pairs. 

[0121] The probe nucleic acid having a sequence complementary to the target sequence may be synthesized using 
similar techniques described supra, for the synthesis of primer sequences. If desired, the probe may be labeled. Ap- 
propriate labels are described supra. 

[0122] In some cases, it may be desirable to determine the length of the PCR product detected by the probe. This 
10 may be particularly true if it is suspected that variant HCV strains may contain deletions within the target region, or if 
one wishes to confirm the length of the PCR product. In such cases it is preferable to subject the products to size 
analysis as well as hybridization with the probe. Methods for determining the size of nucleic acids are known in the 
art, and include, for example, gel electrophoresis, sedimentation in gradients, and gel exclusion chromatography. 
[0123] The presence of the target sequence in a biological sample is detected by determining whether a hybrid has 
15 been formed between the HCV polynucleotide probe and the nucleic acid subjected to the PCR amplification technique. 
Methods to detect hybrids formed between a probe and a nucleic acid sequence are known in the art. For example, 
for convenience, an unlabeled sample may be transferred to a solid matrix to which it binds, and the bound sample 
subjected to conditions which allow specific hybridization with a labeled probe; the solid matrix is than examined for 
the presence of the labeled probe. Alternatively, if the sample is labeled, the unlabeled probe is bound to the matrix, 
20 and after the exposure to the appropriate hybridization conditions, the matrix is examined for the presence of label. 
Other suitable hybridization assays are described supra. 

Determination of Variant HCV Sequences Using PCR 

25 [01 24] In order to identify variant HCV strains, and thereby to design probes for those variants, the above described 
HCV/cPCR method is utilized to amplify variant regions of the HCV genome, so that the nucleotide sequences of these 
variant target regions can be determined. Generally, variant types of HCV might be expected to occur in different 
geographic locations than that in which the HCV1 strain is predominant, for example, Japan, Africa, etc.; or in different 
vertebrate species which are also infected with the virus. Variant HCV may also arise during passage in tissue culture 

30 systems, or be the result of spontaneous or induced mutations. 

[0125] In order to amplify the variant target region, primers are designed to flank the suspect region, and preferably 
are complementary to conserved regions. Primers to two regions of HCV which are probably conserved, based upon 
the Flavivirus model, are described in the Examples. These primers and probes may be designed utilizing the sequence 
information for the HCV1 strain provided in Fig. 18. 

35 [0126] Analysis of the nucleotide sequence of the target region(s) may be by direct analysis of the PCR amplified 
products. A process for direct sequence analysis of PCR amplified products is described in Saiki et al. (1988). 
[0127] Alternatively, the amplified target sequence(s) may be cloned prior to sequence analysis. A method for the 
direct cloning and sequence analysis of enzymatically amplified genomic segments has been described by Scharf 
(1986). In the method, the primers used in the PCR technique are modified near their 5'-ends to produce convenient 

<o restriction sites for cloning directly into, for example, an M1 3 sequencing vector. After amplification, the PCR products 
are cleaved with the appropriate restriction enzymes. The restriction fragments are ligated into the M13 vector, and 
transformed into, for example, a JM 103 host, plated out, and the resulting plaques are screened by hybridization with 
a labeled oligonucleotide probe. Other methods for cloning and sequence analysis are known in the art. 

45 Universal Primers for Flaviviruses and for HCV 

[0128] Studies of the nature of the genome of the HCV, utilizing probes derived from the HCV cDNA, as well as 
sequence information contained within the HCV cDNA, are suggestive that HCV is a Flavi-like virus. These studies 

are d escribed in E.P.O. publication No. 318,216 owned by the herein assignee, and which is incorporated herein in its 

so entirety. A comparison of the HCV cDNA sequence ^rivea^6mlhe~HCV cDNA^Ib?Tes~with known sequences of a 
number of Flaviviruses show that HCV contains sequences which are homologous to conserved sequences in the 
Flaviviruses. These conserved sequences may allow the creation of primers which may be universal in their application 
for amplification of target regions of Flaviviruses, and for HCV. These sequences are the 1 6-mer or smaller sequences 
from the 3*-termini of the primers described in the Examples. Identification of the species is then accomplished utilizing 
55 a probe specific for the species. The genomes of a number of Flaviviruses are known in the art, and include, for example, 
Japanese Encephalitis Virus (Sumiyoshi et al. (1987)), Yellow Fever Virus (Rice et al. (1985)), Dengue Type 2 Virus 
(Hahn et al. (1988)), Dengue Type 4 Virus (Mackow (1987)), and West Nile Virus (Castle et al. (1986)). Identification 
of HCV RNA is accomplished utilizing a probe specific for HCV, the sequence of which can be determined the HCV 
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cDNA sequences provided herein. 

[0129] Alternatively, utilization of sets of probe(s) designed to account for codon degeneracy and therefore contain 
common sequences to the Flaviviruses and to HCV, as determined by a comparison of HCV amino acid sequences 
with the known sequences of the Flaviviruses, allows a general detection system for these viruses. 

5 

Construction of Desired DNA Sequences 

[01 30] Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as described by 
Warner (1984). If desired the synthetic strands may be labeled with 32 P by treatment with polynucleotide kinase in the 

10 presence of 32 P-ATP, using standard conditions for the reaction. 

[0131] DNA sequences, including those isolated from cDNA libraries, may be modified by known techniques, includ- 
ing, for example site directed mutagenesis, as described by Zoller (1982). Briefly, the DNA to be modified is packaged 
into phage as a single stranded sequence, and converted to a double stranded DNA with DNA polymerase using, as 
a primer, a synthetic oligonucleotide complementary to the portion of the DNA to be modified, and having the desired 

*5 modification included in its own sequence. The resulting double stranded DNA is transformed into a phage supporting 
host bacterium. Cultures of the transformed bacteria, which contain replications of each strand of the phage, are plated 
in agar to obtain plaques. Theoretically, 50% of the new plaques contain phage having the mutated sequence, and the 
remaining 50% have the original sequence. Replicates of the plaques are hybridized to labeled synthetic probe at 
temperatures and conditions which permit hybridization with the correct strand, but not with the unmodified sequence. 

20 The sequences which have been identified by hybridization are recovered and cloned. 

Kits for Screening for HCV Derived Polynucleotides 

[0132] Oligomers which are primers for amplification of samples for HCV can be packaged into kits. Kits for ampli- 
25 fication of HCV sequences may include the oligomeric primers used in the amplification. The kits usually contain primers 
in a premeasured or predetermined amount, as well as other suitably packaged reagents and materials, in separate 
suitable containers, needed for the particular amplification protocol(s). For example, the kit may contain standards, 
buffers, supports, enzymes, substrates, label probes, binding partners, and/ or instructions for conducting the test. 

30 Examples 

[0133] Described below are examples of the present invention which are provided only for illustrative purposes, and 
not to limit the scope of the present invention. 

35 Isolation and Sequence of Overlapping HCV cDNA Clones 13i, 26j, CA59a, CA84a, CA156e and CA167b 

[0134] The clones 13i, 26j ( CA59a, CA84a, CA156e and CA167b were isolated from the Iambda-gt11 library which 
contains HCV cDNA (ATCC No. 40394), the preparation of which is described in E.PO. Publication No. 318,216 (pub- 
lished 3 1 May 1 989), and WO 89/04669 (published Uune 1 989). Screening of the library was with the probes described 
40 infra., using the method described in Huynh (1985). The frequencies with which positive clones appeared with the 
respective probes was about 1 in 50,000. 

[0135] The isolation of clone 13i was accomplished using a synthetic probe derived from the sequence of clone 12f. 
The sequence of the probe was: 

45 

5 ' GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG 3 ' . 

[0136] The isolation of clone 26j was accomplished using a probe derived from the S'-region of clone K9-1. The 
so sequence of the probe was: 


5 ' TAT CAG TTA TGC CAA CGG AAG CGG CCC CGA 3 ' . 

55 

[0137] The isolation procedures fordone 12f and for clone k9-1 (also called K9-1) are described in E.P.O. Publication 
No. 318,216, and their sequences are shown in Figs. 1 and 2, respectively. The HCV cDNA sequences of clones 13i 
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and 26j, are shown in Figs. 4 and 5, respectively. Also shown are the amino acids encoded therein, as well as the 
overlap of clone 1 3i with clone 1 2f , and the overlap of clone 26j with clone 1 3i. The sequences for these clones confirmed 
the sequence of clone K9-1 . Clone K9-1 had been isolated from a different HCV cDNA library (See E.P.O. Publication 
No. 218,316). 

[0138] Clone CA59a was isolated utilizing a probe based upon the sequence of the 5-region of clone 26j. The se- 
quence of this probe was: 


5 ' CTG GTT AGC AGG GCT TTT CTA TCA CCA CAA 3 ' . 


[0139] A probe derived from the sequence of clone CA59a was used to isolate clone CA84a. The sequence of the 
probe used for this isolation was: 

15 


5 ' AAG GTC CTG GTA GTG CTG CTG CTA TTT GCC 3 ' . 

20 

[0140] Clone CA156e was isolated using a probe derived from the sequence of clone CA84a. The sequence of the 
probe was: 


5 ' ACT GGA CGA CGC AAG GTT GCA ATT GCT CTA 3 ' . 


[0141] Clone CA167b was isolated using a probe derived from the sequence of clone CA 156e. The sequence of 
the probe was: 

30 

5 ' TTC GAC GTC ACA TCG ATC TGC TTG TCG GGA 3 ' . 

35 [0142] The nucleotide sequences of the HCV cDNAs in clones CA59a, CA84a, CA156e, and CA167b, are shown 
Figs. 6, 7, 8, and 9, respectively. The amino acids encoded therein, as well as the overlap with the sequences of 
relevant clones, are also shown in the figures. 

Creation of "pi" HCV cDNA Library 

40 

[0143] A library of HCV cDNA, the "pi" library, was constructed from the same batch of infectious chimpanzee plasma 
used to construct the lambda-gtll HCV cDNA library (ATCC No. 40394) described in E.P.O. Publication No. 318,216, 
and utilizing essentially the same techniques. However, construction of the pi library utilized a primer-extension method, 
in which the primer for reverse transcriptase was based on the sequence of clone CA59a. The sequence of the primer 
45 was: 


5 ' GGT GAC GTG GGT TTC 3 ' . 
Isolation and Sequence of Clone pi 14a 

[0144] Screening of the "pi" HCV cDNA library described supra., with the probe used to isolate clone CA167b (See 
supra.) yielded clone pi 1 4a. The clone contains about 800 base pairs of cDNA which overlaps clones CA1 67b t CA1 56e, 
55 CA84a and CA59a, which were isolated from the lambda gt-11 HCV cDNA library (ATCC No. 40394). In addition, pi14a 
also contains about 250 base pairs of DNA which are upstream of the HCV cDNA in clone CA167b. 
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Isolation and Sequence of Clones CA216a, CA290a and ag30a 

[0145] Based on the sequence of clone CA167b a synthetic probe was made having the following sequence: 


5' GGC TTT ACC ACG TCA CCA ATG ATT GCC CTA 3' 

The above probe was used to screen the ( which yielded clone CA216a, whose HCV sequences are shown in Fig. 10. 
[0146] Another probe was made based on the sequence of clone CA216a having the following sequence: 


5' TTT GGG TAA GGT CAT CGA TAC CCT TAC GTG 3' 

Screening the lambda-gtll library (ATCC No. 40394) with this probe yielded clone CA290a, the HCV sequences therein 
being shown in Fig. 11. 

[0147] In a parallel approach, a primer-extension cDNA library was made using nucleic acid extracted from the same 
infectious plasma used in the original lambda-gtll cDNA library described above. The primer used was based on the 
sequence of clones CA216a and CA290a: 


5' GAA GCC GCA CGT AAG 3' 

The cDNA library was made using methods similar to those described previously for libraries used in the isolation of 
clones pi 14a and k9-1. The probe used to screen this library was based on the sequence of clone CA290a: 


5' CCG GCG TAG GTC GCG CAA TTT GGG TAA 3' 


Clone ag30a was isolated from the new library with the above probe, and contained about 670 basepairs of HCV 
sequence. See Fig. 12. Part of this sequence overlaps the HCV sequence of clones CA216a and CA290a. About 300 
base-pairs of the ag30a sequence, however, is upstream of the sequence from clone CA290a. The non-overlapping 
sequence shows a start codon (*) and stop codons that may indicate the start of the HCV ORF. Also indicated in Fig. 
12 are putative small encoded peptides (#) which may play a role in regulating translation, as well as the putative first 
amino acid of the putative polypeptide (/), and downstream amino acids encoded therein. 

Isolation and Sequence of Clone CA205a 

[0148] Clone CA205a was isolated from the original lambda gt-11 library (ATCC No. 4Q394), using a synthetic probe 
derived from the HCV sequence in clone CA290a (Fig. 11). The sequence of the probe was: 


5 ' TCA GAT CGT TGG TGG AGT TTA CTT GTT GCC 3 ' . 

The sequence of the HCV cDNA in CA205a, shown in Fig. 1 3, overlaps with the cDNA sequences in both clones ag30a 
and CA290a. The overlap of the sequence with that of CA290a is shown by the dotted line above the sequence (the 
figure also shows the putative amino acids encoded in this fragment). 

[0149] As observed from the HCV cDNA sequences in clones CA205a and ag30a, the putative HCV polyprotein 
appears to begin at the ATG start codon; the HCV sequences in both clones contain an in-frame, contiguous double 
stop codon (TGATAG) forty two nucleotides upstream from this ATG. The HCV ORF appears to begin after these stop 
codons, and to extend for at least 8907 nucleotides (See the composite HCV cDNA shown in Fig. 18). 
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Isolation and Sequence of Clone 18g 

[0150] Based on the sequence of clone ag30a (See Fig. 12) and of an overlapping clone from the original lambda 
gt-11 library (ATCC No. 40394), CA230a, a synthetic probe was made having the following sequence: 


5 ' CCA TAG TGG TCT GCG GAA CCG GTG AGT ACA 3 ' . 


Screening of the original lambda-gtll HCV cDNA library with the probe yielded clone 18g, the HCV cDNA sequence of 
which is shown in Fig. 14. Also shown in the figure are the overlap with clone ag30a, and putative polypeptides encoded 
within the HCV cDNA. 

[0151] The cDNA in clone 18g (C18g or 18g) overlaps that in clones ag30a and CA205a, described supra. The 
sequence of C18g also contains the double stop codon region observed in clone ag30a. The polynucleotide region 
upstream of these stop codons presumably represents part of the 5'-region of the HCV genome, which may contain 
short ORFs, and which can be confirmed by direct sequencing of the purified HCV genome. These putative small 
encoded peptides may play a regulatory role in translation. The region of the HCV genome upstream of that represented 
by C18g can be isolated for sequence analysis using essentially the technique described in E.P.O. Publication No. 
318,216 for isolating cDNA sequences upstream of the HCV cDNA sequence in clone 12f. Essentially, small synthetic 
oligonucleotide primers of reverse transcriptase, which are based upon the sequence of C18g, are synthesized and 
used to bind to the corresponding sequence in HCV genomic RNA. The primer sequences are proximal to the known 
S'-terminal of C18g, but sufficiently downstream to allow the design of probe sequences upstream of the primer se- 
quences. Known standard methods of priming and cloning ar eused. The resulting cDNA libraries are screened with 
sequences upstream of the priming sites (as deduced from the elucidated sequence of C18g). The HCV genomic RNA 
is obtained from either plasma or liver samples from individuals with NANBH. Since HCV appears to be a Flavi-like 
virus, the 5-terminus of the genome may be modified with a "cap" structure. It is known that Flavivirus genomes contain 
S'-terminal "cap" structures. (Yellow Fever virus, Rice et al. (1988); Dengue virus, Hahn et a! (1988); Japanese En- 
cephalitis Virus (1987)). 

Isolation and Sequence of Clones from the beta-HCV cDNA library 

[0152] Clones containing cDNA representative of the 3'-terminal region of the HCV genome were isolated from a 
cDNA library constructed from the original infectious chimpanzee plasma pool which was used for the creation of the 
HCV cDNA lambda-gtll library (ATCC No. 40394), described in E.RO. Publication No. 318,216. In order to create the 
DNA library, RNA extracted from the plasma was "tailed" with poly rA using poly (rA) polymerase, and cDNA was 
synthesized using oligo(dT) 12 . 18 as a primer for reverse transcriptase. The resulting RNAxDNA hybrid was digested 
with RNAase H, and converted to double stranded HCV cDNA. The resulting HCV cDNA was cloned into Iambda-gt10, 
using essentially the technique described in Huynh (1985), yielding the beta (or b) HCV cDNA library. The procedures 
used were as follows. 

[0153] An aliquot (12ml) of the plasma was treated with proteinase K, and extracted with an equal volume of phenol 
saturated with 0.05M Tris-CI, pH 7.5, 0.05% (v/v) betamercaptoethanol, 0.1% (w/v) hydroxyquinolone, 1 mM EDTA. 
The resulting aqueous phase was re-extracted with the phenol mixture, followed by 3 extractions with a 1:1 mixture 
containing phenol and chloroformrisoamyl alcohol (24:1), followed by 2 extractions with a mixture of chloroform and 
isoamyl alcohol (1:1). Subsequent to adjustment of the aqueous phase to 200 mM with respect to NaCI, nucleic acids 
in the aqueous phase were precipitated overnight at -20°C, with 2.5 volumes of cold absolute ethanol. The precipitates 
were collected by centrifugation at 10,000 RPM for 40 min., washed with 70% ethanol containing 20 mM NaCI, and 
with 100% cold ethanol, dried for 5 min. in a dessicator, and dissolved in water. 

[0154] The isolatedjiucleic acids from the infectious chimpanzee plasma pool were tailed with poly rA utilizing poly- 

A polymerase in the presence of human placenta ribonuclease" inhibitor (HPRi) (purchased from Amersrmm Corp;)," 
utilizing MS2 RNA as carrier. Isolated nucleic acids equivalent to that in 2 ml of plasma were incubated in a solution 
containing TMN (50 mM Tris HCI, pH 7.9. 10 mM MgCI 2 , 250 mM NaCI, 2.5 mM MnCI 2 , 2 mM dithiothreitol (DTT)), 40 
micromolar alpha-p 2 P] ATP, 20 units HPRI (Amersham Corp.), and about 9 to 1 0 units of RNase free poly-A polymerase 
(BRL). Incubation was for 10 min. at 37°C, and the reactions were stopped with EDTA (final concentration about 250 
mM). The solution was extracted with an equal volume of phenol-chloroform, and with an equal volume of chloroform, 
and nucleic acids were precipitated overnight at -20°C with 2.5 volumes of ethanol in the presence of 200 mM NaCI. 
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Isolation of Clone b5a 

[0155] The beta HCV cDNA library was screened by hybridization using a synthetic probe, which had a sequence 
based upon the HCV cDNA sequence in clone 15e. The isolation of clone 15e is described in E.P.O. Publication No. 
5 318,216, and its sequence is shown in Fig. 3. The sequence of the synthetic probe was: 


5 ' ATT GCG AGA TCT ACG GGG CCT GCT ACT CCA 3 ' . 

10 

Screening of the library yielded clone beta-5a (b5a), which contains an HCV cDNA region of approximately 1000 base 
pairs. The 5-region of this cDNA overlaps clones 35f, 19g, 26g, and 15e (these clones are described supra). The 
region between the 3'-terminal poly- A sequence and the 3*-sequence which overlaps clone 15e, contains approximately 
*5 200 base pairs. This clone allows the identification of a region of the 3'-terminal sequence the HCV genome. 

[0156] The sequence of b5a is contained within the sequence of the HCV cDNA in clone 16jh (described infra). 
Moreover, the sequence is also present in CC34a, isolated from the original Iambda-gt11 library (ATCC No. 40394). 
(The original lambda-gtll library is referred to herein as the "C" library). 

20 Isolation and Sequence of Clones Generated by PCR Amplification of the 3'-Region of the HCV Genome 

[0157] Multiple cDNA clones have been generated which contain nucleotide sequences derived from the 3*-region 
of the HCV genome. This was accomplished by amplifying a targeted region of the genome by a polymerase chain 
reaction technique described in Saiki et al. (1986), and in Saiki et al. (1988), which was modified as described below. 

25 The HCV RNA which was amplified was obtained from the original infectious chimpanzee plasma pool which was used 
for the creation of the HCV cDNA lambda-gtll library (ATCC No. 40394) described in E.P.O. Publication No. 318,216. 
Isolation of the HCV RNA was as described supra. The isolated RNA was tailed at the 3'-end with ATP by E. coli poly- 
A polymerase as described in Sippel (1973), except that the nucleic acids isolated from chimp serum were substituted 
for the nucleic acid substrate. The tailed RNA was then reverse transcribed into cDNA by reverse transcriptase, using 

30 an oligo dT-primer adapter, essentially as described by Han (1987), except that the components and sequence of the 
primer-adapter were: 

35 Stuf fer Not I SP6 Promoter Primer 

AATTC GCGGCCGC CATACGATTTAGGTGACACTATAGAA T 15 

The resultant cDNA was subjected to amplification by PCR using two primers: 

40 

Primer Sequence 

JH32 (30mer) ATAGCGGCCGCCCTCGATTGCGAGATCTAC 
^ JH11 (20mer) AATTCGGGCGGCCGCCATACGA 

The JH32 primer contained 20 nucleotide sequences hybridizable to the 5'-end of the target region in the cDNA, with 
an estimated T m of 66°C. The JH11 was derived from a portion of the oligo dT-primer adapter; thus, it is specific to the 
so 3*-end of the cDNA with a T m of 64° C. Both primers were designed to have a recognition site for the restriction enzyme, 
Notl, at the 5'-end, for use in subsequent cloning of the amplified HCV cDNA. 

[0158] The PCR reaction was carried out by suspending the cDNA and the primers in 100 microliters of reaction- 
mixture containing the four deoxynucleoside triphosphates, buffer salts and metal ions, and a thermostable DNA 
polymerase isolated from Thermusaquaticus (Taq polymerase), which are in a Perkin Elmer Cetus PCR kit (N801-0043 
55 or N80 1-0055). The PCR reaction was performed for 35 cycles in a Perkin Elmer Cetus DNA thermal cycler. Each 
cycle consisted of a 1 .5 min denaturation step at 94°C, an annealing step at 60°C for 2 min, and a primer extension 
step at 72°C for 3 min. The PCR products were subjected to Southern blot analysis using a 30 nucleotide probe, JH34, 
the sequence of which was based upon that of the 3Merminal region of clone 15e. The sequence of JH34 is: 
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5 ' CTT GAT CTA CCT CCA ATC ATT CAA AGA CTC 3 ' . 

s The PCR products detected by the HCV cDNA probe ranged in size from about 50 to about 400 base pairs. 

10159] In order to clone the amplified HCV cDNA, the PCR products were cleaved with Notl and size selected by 
polyacrylamide gel electrophoresis. DNA larger than 300 base pairs was cloned into the Notl site of pUC18S The vector 
pUC18S is constructed by including a Notl polylinker cloned between the EcoRI and Sail sites of pUC18. The clones 
were screened for HCV cDNA using the JH34 probe. A number of positive clones were obtained and sequenced. The 

w nucleotide sequence of the HCV cDNA insert in one of these clones, 16jh, and the amino acids encoded therein, are 
shown in Fig. 15. A nucleotide heterogeneity, detected in the sequence of the HCV cDNA in clone 16jh as compared 
to another clone of this region, is indicated in the figure. 

Isolation and Sequence of Clone 6k 

15 

[0160] Based on the sequence of clone 16jh and clone b5a (see supra), a synthetic probe was made having the 
following sequence: 

20 5 ' TCT TCA ACT GGG CAG TAA GAA CAA AGC TCA 3 ' . 


Screening of the original Iambda-gt11 HCV cDNA library (described in E.P.O. Publication No. 318,216) with the probe 
yielded clones with a frequency of approximately 1 in 10 6 ; one of these was called clone 6k (also called C6k), the HCV 
25 cDNA sequence of which is shown in Fig. 16. Also shown in the figure are the overlap with clone 16jh, and putative 
polypeptides encoded within the HCV cDNA. Sequence information on the HCV cDNA in clone 6k was obtained from 
only one strand. Information on the deposit of this clone is provided infra, wherein the clone is listed as Lambda gt11 
C6k. Confirmation of the C6K sequence as part of an ORF encoding HCV1 polypeptide has been obtained by sequenc- 
ing other overlapping clones. 

30 

Isolation and Sequence of Clone p131jh 

[0161] A clone containing sequence from the 3'-region of the HCV genome, and which contains an in-frame stop 
codon, was isolated essentially as described supra., for the isolation of clones generated by PCR amplification of the 
35 3'region of the genome, except that HCV1 RNA was converted to cDNA using the oligonucleotide 

5' AAT TCG CGG CCG CCA TAC GAT TTA GGT GAC 
40 ACT ATA GAA T 15 3' . 

The cDNA was then amplified by the PCR reaction using the primers: 

45 

5' TTC GCG GCC GCT ACA GCG GGG GAG ACA T 3' 

and 

5 ' AAT TCG CGG CCG CCA TAC GA 3 ' . 


[01 62] After amplification, the PCR products were precipitated with spermine, digested with Notl, and extracted with 
55 phenol. The purified products were cloned into the Notl site of pUC18S, and HCV positive clones were selected using 
the oligonucleotide: 
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5 ' CGA TGA AGG TTG GGG TAA AC A CTC CGG CCT 3 ' . 


The HCV cDNA in one clone, designated p131jh, is shown in Fig. 17. This clone contains an in-frame stop codon for 
the large ORF contained in the HCV genome. 

Isolation and Sequence of Clone 5'-clone32 

[0163] A clone containing sequence from the 5'-region of the HCV genome, upstream of the sequence in clone 
b114a, was isolated and the nucleotide sequence determined by a modification of the method for the isolation and 
sequence of clones generated by PCR amplification of the 3*-region of the genome, described in U.S. S.N. 456,637, 
which is incorporated by reference. Generally, a target region of the genome was amplified by the PCR technique 
described in Saiki et al. ( 1 986), and in Saiki et al ( 1 988). The HCV RNA which was amplified was obtained by extracting 
human serum (U.S. clinical isolate, HCV27) using a cold guantdinium thiocyanate method described by Han et al. 
(1987). The extracted RNA was converted into single stranded cDNA with reverse transcriptase, using a primer, JH94, 
which is complementary to nucleotides -250 to -223 of the HCV genome (see Fig. 18). The sequence of JH94 is: 


5 ' CCT GCG GCC GCA CGA CAC TCA TAC TAA 3 ' - 


Conversion of single- to double-stranded HCV cDNA was accomplished by tailing the DNA with approximately 20 to 
50 dA residues using terminal deoxynucleotidyl transferase (Sambrook et al. (1989), MOLECULAR CLONING), and 
replicating the tailed molecule using the following oiigo-dT primer-adapter, which contains a Notl site, and an sp6 
promoter: 

Stuf f er Notl SP6 Promoter Primer 

AATTC GCGGCCGC CATACGATTTAGGTGACACTATAGAA T 15 

The resultant cDNA was subjected to amplification by PCR using two primers, JH94 (described supra.) and JH11, 
which has the following sequence. 

Primer Sequence 

JH11 (20mer) AATTCGGGCGGCCGCCATACGA 

[0164] The PCR reaction was carried out by suspending the cDNA and the primers in 100 microliters of reaction 
mixture containing the four deoxynucleoside triphosphates, buffer salts and metal ions, and a thermostable DNA 
polymerase isolated from Thermus aquaticus (Tag polymerase), which are in a Perkin Elmer Cetus PCR kit (N80 1-0043 
or N801-0055). The PCR reaction was performed for 35 cycles in a Perkin Elmer Cetus DNA thermal cycler. Each 
cycle consisted of a 1.5 min denaturation step at 94°C, an annealing step at 60°C for 2 min, and a primer extension 
step at 72°C for 3 min. 

[0165] The PCR products were digested with Notl, and cloned into pUC18S. Clones containing HCV nucleotide 
sequences were obtained by screening with a probe, Alex90, which is derived from nucleotides -312 to -283 of the 
HCV1 genome, and which has the sequence: 


5 ' ACC ATG AAT CAC TCC CCT GTG AGG AAC TAC 3 ' . 


The HCVcDNAs in the isolated clones were sequenced by the dideoxy chain termination method (Sanger etal. (1977)). 
The sequence of HCV cDNA in one of the isolated clones, 5*-clone32, spans the region of nucleotides -224 to -341 in 
Fig. 18. 

[0166] An analysis of the nucleotide sequence of the HCV cDNA showed that the replicate of the HCV RNA strand 
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contains a GC-rich stretch which may be capable of forming a stable hairpin structure: 


G-T A 


A C-C-C-C-C-G C-C-G 5' 


T-G-G-G-G-G-C-G-A-C 3 ' . 

In the structure, the dashed lines indicate possible hydrogen bonds between complementary nucleotides. 

[0167] A search in the computer database, Genebank, revealed that homologous sequences were absent from 

known viral sequences. Thus, this sequence may be unique to the S'-terminus of the HCV genome. 

[01 68] A hairpin structure may serve as a recognition signal for a transcriptase and/or it may contribute to the stability 

of the RNA at the S'-terminus. 

Compiled HCV cDNA Sequences 

[0169] An HCV cDNA sequence has been compiled from a series of overlapping clones derived from various HCV 
cDNA libraries described herein, and in E.P.O. Publication No. 318,216. The clones from which Fig. 18 has been 
derived are clone 5-32, b114a, 18g, ag30a, CA205a, CA290a, CA216a, pi14a, CA167b, CA156e, CA84a, CA59a, 
K9-1 (also called k9-1), 26j, 13i, 12f, 14i, 11b, 7f, 7e, 8h, 33c, 40b, 37b, 35, 36, 81, 32, 33b, 25c, 14c, 8f, 33f ( 33g, 39c, 
35f, 19g, 26g, 15e, b5a, 16jh, C6k and p131jh. The methods for isolation of these clones, as well as their sequences, 
are discussed herein, and in E.P.O. Publication No. 318,216, which is incorporated herein by reference. In Fig. 18, the 
three dashes above the sequence indicate the position of the putative initiator methionine codon. 
[0170] Clone b114a overlaps with clones 18g, ag30a, and CA205a, except that clone b114a contains an extra two 
nucleotides upstream of the sequence in clone 18g (i.e., 5'-CA). These extra two nucleotides have been included in 
the HCV genomic sequence shown in Fig. 18. 

[0171] It should be noted that although several of the clones described supra, have been obtained from libraries 
other than the original HCV cDNA Iambda-gt1 1 C library (ATCC No. 40394), these clones contain HCV cDNA sequences 
which overlap HCV cDNA sequences in the original library. Thus, essentially all of the HCV sequence is derivable from 
the original lambda-gtil C library (ATCC No. 40394) which was used to isolate the first HCV cDNA clone (5-1-1). The 
isolation of clone 5-1-1 is described in E.P.O. Publication No. 318,216, which is incorporated herein by reference. 
[0172] The putative sequence of the major HCV polyprotein encoded in the composite of HCV1 cDNA is also shown. 
The first amino acid in the sequence is the putative initiator methionine of the large ORF. The variant amino acids, due 
to the clonal heterogeneities, are indicated above the sequence. Since the lambda gt11 library was created from serum 
obtained from one individual (see E.P.O. Publication No. 318,216), the results suggest that variant viral sequences 
(both nucleotide and amino acid) are present in that individual. 

[0173] An examination of the composite HCV cDNA sequence shows that besides the large ORF, there are a number 
of ORFs upstream of that encoding the polyprotein, and within the sequence encoding the polyprotein there are a large 
number of smaller ORFs in the other two translational frames. The ORFs upstream of the HCV polyprotein are shown 
in the Table immediately below. 
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Table 

ORFs Upstream of that Encoding the Large 
HCV Polyprotein 

Nucl^ # Translation Frame Amino Acid Sequence 

-310 1 MNHSPVRNYCLHAESV 

-329 3 MGATLHHESLPCEELL 

SSRRKRLAMALV 

-246 2 MSWQPPGPPLPGEP 

-127 1 MPGDLGVPPQDC 


The reading frame, position, and size of the ORFs downstream of the sequence encoding the putative initiator MET 
of the polyprotein are shown in the Table below. The major polyprotein is that translated from reading frame 2. 

20 

Table 


ORFs Downstream of the Putative Initiator MET Encoding Sequence 

Reading Frame 

Size(aa) 

Position(bp) 

1 

168 

696 

1 

105 

2343 

1 

119 

5616 

2 

3025 

-42 

3 

160 

5 

3 

111 

1667 

3 

148 

6893 


[0174] In addition to the above, an examination of the sequence which is complementary to the genomic strand of 
35 HCV RNA also contains several small ORFs. One of these ORFs, which is complementary to nucleotides -341 to +837 
in the HCV RNA sequence, encodes a polypeptide of 385 amino acids. 


Comparison of the Sequences of S'-Regions Obtained from HCV Isolates from Different Geographical Locations 

40 [0175] Nucleotide sequences from the 5'- regions of HCV isolates from the U.S.A. (HCV18, HCV27), from Italy 
(HCVI1, HCVI24), and from Korea (HCVK1) were compared. 

[0176] Isolation of the HCV cDNA sequences was essentially as described supra., for the isolation of 5'-clone32, 
except for the following. The extracted RNA was reverse-transcribed into cDNA using as primers either JH51 or r16, 
which are complementary to HCV nucleotides -90 to -73 and 366 to 383, respectively. The sequences of these primers 
45 are as follows. 


Sequence 
5' CCC AAC ACT ACT CGG CTA 3' 
5' CAC GTA AGG GTA TCG ATG 3' 

Amplification of the HCV dsDNA was by the PCR method using JH93 and JH52 as 5'- and 3'- primers, respectively. 
55 The HCV sequence in JH93 is derived from HCV nucleotides -317 to -296, that in JH52 is from HCV nucleotides -93 
to -117; the nucleotide numbers are indicated in parentheses below the sequences. In JH52 the underlined dinucleotide 
has been mutated to create the Notl site. The sequences of these primers are the following. 


Primer 
so JH51 
rl6 
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(Primer) Stuf fer NotI HCV sequence 

(JH93) 5' TTC GCGGCCGC ACTCCATGAATCACTCCCC 3 

(-317) (-296) 

(JH52) 5' AGTCTT GCGGCCGC ACGCCCAAATC 3' 

(-93) (-117) 

After amplification, the PCR products were cleaved by NotI, and cloned into pUC1 8S. The HCV cDNAs were sequenced 
either by direct sequencing after amplification by PCR, or alternatively, the cloned HCV cDNAs were sequenced by 
the primer extension and the dideoxy method. Primer extension and the dideoxy method of sequencing were performed 
as described supra., for the sequence of 5'-clone32. 

[0177] The PCR method for direct sequencing used Alex90 (see supra, for the sequence) as the S'-primer, and r25 
as the 3'-primer. Alex90 is derived from HCV nucleotides -312 to -283, and r25 is derived from nucleotides 365 to 342 
(See Fig. 18). The sequence of r25 is: 


5 ' ACC TTA CCC AAA TTG CGC GAC CTA 3 ' . 


[0178] A comparison of the sequences of the 5*-region of HCV27, HCVK1, HCVI1, HCVI24, and HCV 18 with the 
sequence of the prototype HCV, HCV1, showed the following. The examined 5'- region is highly conserved amongst 
the 5 HCV isolates. The sequences appeared to be identical except for one nucleotide which was deleted at position 
- 171 in HCVI24, and for the ambiguity in four nucleotides at positions -222 to -219 in isolate HCVK1. 
[0179] The high levels of sequence conservation in this region may reflect the role of this region in viral replication, 
and/or transcription, and/or translation. 

Sequence Variations in HCV Isolates from Different Individuals 

[01 80] Isolates of HCV which contain sequences which deviate from CDC/HCV1 were identified in human individuals, 
some of whom were serologically positive for anti-C 100-3 antibodies (EC 10 was antibody negative). Identification of 
these new isolates was accomplished by cloning and sequencing segments of the HCV genome which had been 
amplified by the PCR technique using CDC/HCI sequences. Amplification was accomplished essentially based on an 
HCV/cPCR method. The method utilizes primers and probes based upon the HCV cDNA sequences described herein. 
The first step in the method is the synthesis of a cDNA to either the HCV genome, or its replicative intermediate, using 
reverse transcriptase. After synthesis of the HCV cDNA, and prior to amplification, the RNA in the sample is degraded 
by techniques known in the art. A designated segment of the HCV cDNA is then amplified by the use of the appropriate 
primers. The amplified sequences are cloned, and clones containing the amplified sequences are detected by a probe 
which is complementary to a sequence lying between the primers, but which does not overlap the primers. 

HCV Isolates Isolated from Humans in the U.S. 

[0181] Blood samples which were used as a source of HCV virions were obtained from the American Red Cross in 
Charlotte, North Carolina, and from the Community Blood Center of Kansas, Kansas City, Missouri. The samples were 
screened for antibodies to the HCV C100-3 antigen using an ELISA assay as described in E.P.O. Publication No. 
318,216, and subjected to supplemental Western blot analysis using a polyclonal goat anti-human HRP to measure 
antT-HCV aTitlbodies/Two samples7#23^ncJ #27rfrom"trTe~Arnerican Red Cross~a Community Blood Center— 

of Kansas, respectively, were determined to be HCV positive by these assays. 

[01 82] Viral particles present in the serum of these samples were isolated by ultracentrifugation under the conditions 
described by Bradley et at. (1985). RNA was extracted from the particles by digestion with proteinase K and SDS at 
final concentrations of 10 micrograms/ml proteinase K, and 0.1% SDS; digestion was for 1 hour at 37°C. Viral RNA 
was further purified by extraction with chloroform-phenol, as described in E.P.O. Publication No. 318,216. 
[0183] HCV RNA in the preparation of RNA was reverse transcribed into cDNA essentially as described in E.P.O. 
Publication No. 318,216, except that the oligonucleotide JHC 7, which corresponds to the cDNA sequence 1958-1939, 
and which has the following sequence, was used as primer for the reverse transcriptase reaction. 
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JHC 7: CCA GCG GTG GCC TGG TAT TG. 


[0184] After both strands of the cDNA were synthesized, the resulting cDNA was then amplified by the PCR method 
essentially as described supra, for the isolation of clones generated by PCR amplification, except that the oligonucle- 
otide primers used, i.e., JHC 6 and ALX 80, were designed to amplify a 1080 nucleotide segment of the HCV genome 
from CDC/HCV1 nucleotides 673 to 1751. The primers, in addition, are designed to incorporate a NOT I restriction site 
at the 3'-end of the PCR product, and a blunt end at the 5'-terminus. The sequences of the primers is: 

ALX 80: TTT GGG TAA GGT CAT CGA TAC CCT TAC GTG; 


and 


20 JHC 6: ATA TGC GGC CGC CTT CCG TTG GCA TAA. 

ALX 80 corresponds to nucleotides 673-702 of the CDC/HCV1 sequence; JHC 6 corresponds to nucleotides 1 752- 1 738 
of the HCV1 (in addition there are 12 extra nucleotides which encode a Notl site). The designation of nucleotides in 

25 JHC 6, i.e., a declining number, indicates the placement in the anti-sense strand. 

[0185] After PCR amplification with the above described primers, the blunt end terminus was converted into a NOT 
I site as follows. A homopolymer tail of 15 dGs was attached to the PCR product using terminal deoxynucleotide 
transferase, and the products were again subjected to amplification by PCR using as primers JHC 6 and JHC 13. The 
latter primer, JHC 13, the sequence of which follows, is designed to contain a NOT I site in addition to an SP6 phage 

30 promoter. (The SP6 promoter is described in GENETIC ENGINEERING, J. Setlow Ed. (1988). 

JHC 13: AAT TCG CGG CCG CCA TAC GAT TTA GGT GAC 
35 ACT ATA GAA CCC CCC CCC CCC CCC • 

[0186] In order to clone the amplified HCV cDNA, the PCR products were cleaved with Notl, precipitated with sper- 
mine to remove free oligonucleotides (Hoopes et al. (1981)), and cloned into the Notl site of pUC18S (see Section IV. 
A. 34.). The HCV cDNAs in three clones derived from each HCV isolate, were subjected to sequence analysis. Analysis 
40 was essentially by the method described in Chen and Seeburg (1985). 

[0187] Consensus sequences of the clones derived from HCV in samples 23 and 27 are shown in Fig. 46 and Fig. 
47, respectively. The variable sequences are also shown in these figures, as are the amino acids encoded in the 
consensus sequences. 

[0188] Fig. 39 and Fig. 40 show comparisons of the aligned positive strand nucleotide sequences (Fig. 39) and 
45 putative amino acid sequences (Fig. 40) of samples 23, 27, and HCV1. The amino acid sequence of HCV1 in Fig. 39 
represents amino acid numbers 129-467 of the HCV polyprotein encoded by the large ORF in the HCV genomic RNA. 
An examination of Fig. 46 and Fig. 47 show that there are variations in the sequences of the three isolated clones. The 
sequence variations at the nucleotide level and the amino acid level are summarized in the table immediately below. 
In the table, the polypeptides designated S and NS1 represent amino acid numbers 130 to -380, and 380 to -470, 
so respectively. The numbering is from the putative initiator methionine. The terminology S and NS1 is based upon the 
positioning of the sequences encoding the polypeptides using the Flavivirus model. As discussed above, however, 
recent evidence suggests that there is not total correlation between HCV and the Flaviviruses with regard to viral 
polypeptide domains, particularly in the putative E/NS1 domains. Indeed, HCV polypeptides and their coding domains 
may exhibit substantial deviation from the Flavivirus model. 

55 
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Table 


Sequence Homology 


Nucleotide Encoding 

Amino Acid Encoded 

overall 

S 

NS1 

overall 

S 

NS1 


% 

% 

% 

% 

% 

% 

HCV1/HCV23 

93 

95 

91 


92 

95 

87 

HCV1/HCV27 

89 

93 

84 


89 

95 

82 

HCV23/HCV27 

89 

93 

85 


90 

93 

84 


[0189] Although there are variations in the newly isolated HCV sequences, the cloned sequences from samples 23 
and 27 (called HCV23 and HCV27) each contain 1019 nucleotides, indicating a lack of deletion and addition mutants 
in this region in the selected clones. The sequences in Figs. 39 and 40 also show that the isolated sequences are not 
rearranged in this region. 

[0190] A comparison of the consensus sequences for HCV1 and for the other isolates of HCV is summarized in the 
Table, supra. The sequence variations between the chimpanzee isolate HCV1 , and the HCVs isolated from humans 
are about the same as that seen between the HCVs of human origin. 

[0191] It is of interest that the sequence variations in two of the putative domains is not uniform. The sequence in a 
putative S region appears to be relatively constant, and randomly scattered throughout the region. In contast, a putative 
NS1 region has a higher degree of variability than the overall sequence, and the variation appears to be in a hyper- 
variable pocket of about 28 amino acids which is located about 70 amino acids downstream from the putative N- 
terminus of the putative polyprotein. 

[0192] Although it may be argued that the detected variations were introduced during the amplification process, it is 
unlikely that all of the variations are from this result. It has been estimated that Taq polymerase introduces errors into 
a sequence at approximately one base per 10 kilobases of DNA template per cycle (Saiki et al. (1988)). Based upon 
this estimate, up to 7 errors may have been introduced during the PCR amplification of the 1019 bp DNA fragment. 
However, the three subclones of HCV-23 and HCV-27 yielded 29 and 14 base variations, respectively. The following 
suggest that these variations are naturally occurring. About 60% of the base changes are silent mutations which do 
not change the amino acid sequence. Variations introduced by the Taq polymerase during PCR amplification would 
be expected to occur randomly; however, the results show that the variant sequences are clustered in at least one 
specific region. Moreover, a consensus sequence was derived by sequencing multiple different clones derived from 
the PCR amplified products. 

HCV Isolates from Humans in Italy and in the U.S. 

[0193] Segments of HCV RNA present in different isolates were amplified by the HCV/cPCR method. These seg- 
ments span a region of -0.6Kb to -1.6Kb downstream from the methionine encoding start codon of the putative HCV 
polyprotein. The isolates are from biological specimens obtained from HCV infected individuals. More specifically, 
isolate HCT #18 is from human plasma from an individual in the U.SA, EC1 and EC 10 are from a liver biopsy of an 
Italian patient, and Th is from a peripheral blood mononucleocyte fraction of an American patient. Comparable seg- 
ments of HCV RNA have been isolated from a chimpanzee. 

[01 94] RNA was extracted from the human plasma specimens using phenol:CHCI3:isoamyl alcohol extraction. Either 
0.1 ml or 0.01 ml of plasma was diluted to a final volume of 1.0 ml, with a TENB/proteinase K/SDS solution (0.05 M 
Tris-HCL, pH 8.0, 0.001 M EDTA, 0.1 M NaCI, 1 mg/ ml Proteinase K, and 0.5% SDS) containing 10 to 40 micrograms/ 
ml polyadenylic acid, and incubated at 37°C for 60 minutes. After this proteinase K digestion, the resultant plasma 
fractions were deproteinized by extraction with TE (50 mM Tris-HCI, pH 8.0, 1 mM EDTA) saturated phenol, pH 6.5. 
The phenol phase was separated by centrifugation, and. was reextracted withTENB,containing0.1%_SDS. Tiiej'esujting 
aqueous phases from each extraction were pooled, and extracted twice with an equal volume of phenol/chloroform/ 
isoamyl alcohol [1:1(99:1)], and then twice with an equal volume of a 99:1 mixture of chloroform/isoamyl alcohol. 
Following phase separation by centrifugation, the aqueous phase was brought to a final concentration of 0.2 M Na 
Acetate, and the nucleic acids were precipitated by the addition of two volumes of ethanol. The precipitated nucleic 
acids were recovered by ultracentrifugation in a SW 41 rotor at 38 K, for 60 minutes at 4°C, or in a microfuge for 10 
minutes at 10K, 4°C. 

[0195] RNA extracted from the liver biopsy was provided by Dr. F. Bonino, Ospedale Maggiore di S. Giovanni Battista, 
Torino, Italy. 
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[0196] The mononucleocyte fraction was obtained by sedimentation of the individual's aliquot of blood through Ficotl- 
Paque® (Pharmacia Corp), using the manufacturer's directions. Total RNA was extracted from the fraction using the 
guanidinium thiocyanate procedure described in E.RO. Publication No. 318,216 (See also Choo et al (1989)). 
[01 97] Synthesis of HCV cDNA from the samples was accomplished using reverse transcriptase, and primers derived 
from clone 156e and from clone K91. These primers, which are anti-sense relative to the genomic RNA, have the 
following sequences. 


10 


156el6B: 


CGA CAA GAA AGA CAG A 3 


and 


15 


K91/16B 


CGT TGG CAT AAC TGA T 3' 


20 


30 


35 


[0198] Following ethanol precipitation, the precipitated RNA or nucleic acid fraction was dried, and resuspended in 
DEPC treated distilled water. Secondary structures in the nucleic acids were disrupted by heating at 65°C for 10 min- 
utes, and the samples were immediately cooled on ice. cDNA was synthesized using 1 to 3 micrograms of total RNA 
from liver, or from nucleic acids (or RNA) extracted from 10 to 100 microliters of plasma. The synthesis utilized reverse 
transcriptase, and was in a 25 microliter reaction, using the protocol specified by the manufacturer, BRL. AH reaction 
mixtures for cDNA synthesis contained 23 units of the RNAase inhibitor, RNASIN™ (Fisher/Promega). Following cDNA 
synthesis, the reaction mixtures were diluted with water, boiled for 10 minutes, and quickly chilled on ice. 
[0199] Each set of samples was subjected to two rounds of PCR amplification. The primers for the reactions were 
selected to amplify regions designated "EnvL" and EnvR". The "EnvL" region encompasses nucleotides 669-1243, 
and putative amino acids 117 to 308; the "EnvR" region encompasses nucleotides 1215-1629, and encodes putative 
amino acids 300-408 (the putative amino acids are numbered starting from the putative methionine initiation codon). 
The relationship of these regions relative to the putative polyprotein encoded in the HCV cDNA, and to the polypeptides 
encoded in the Flavirus model is shown in Fig. 48. 

[0200] The primers for the first round of PCR reactions were derived from the HCV cDNA sequences in either clone 
ag30a, clone 156e, or clone k9-1. The primers used for the amplification of the EnvL region were 156e16B (shown 
supra), and ag30a16A for the sense strand; the amplification of the EnvR region utilized the primer K91/16B (shown 
supra), and 156e16a for the sense strand. The sequences of the sense strand primers are the following. 


For EnvL, ag30al6A: 


5' CTC TAT GGC AAT GAG G 


40 


and 


45 


For EnvR, 156el6A: 


AGC TTC GAC GTC ACA T 3' 


50 


[0201] The PCR reactions were performed essentially according to the manufacturer's directions (Cetus-Perkin- 
Elmer), except for the addition of 1 microgram of RNase A. The reactions were carried out in a final volume of 100 
microliters. The PCR was performed for 30 cycles, utilizing a regimen of 94°C (1 min), 37°C (2 min), and 72°C (3 min), 
with a 7 minute extension at 72°C for the last cycle. The samples were then extracted with phenol:CHCI 3 , ethanol 
precipitated two times, resuspended in 1 0 mM Tris HCI, pH 8.0, and concentrated using Centricon-30 (Amicon) filtration. 
This procedure efficiently removes oligonucleotides less than 30 nucleotides in size; thus, the primers from the first 
round of PCR amplification are removed. 

[0202] The Centricon-30 concentrated samples were then subjected to a second round of PCR amplification using 
probes designed from clones 202a and 156e for the EnvL region, and from 156e and 59a for the EnvR region. The 
primers for amplification of the EnvL region have the following sequences. 
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202aEnv41a: 5' CTT GAA TTC GCA ATT TGG GTA 

AGG TCA TCG ATA CCC TTA CG 3' 

and 

156e38B': 5' CTT GAA TTC GAT AGA GCA ATT 

GCA ACC TTG CGT CGT CC 3 ' . 

The primers for amplification of the EnvR region in RNAs derived from humans have the following sequences. 

156e38A': 5' CTT GAA TTC GGA CGA CGC AAG 

GTT GCA ATT GCT CTA TC 3' 

and 

59aEnv39C: 5' CTT GAA TTC CAG CCG GTG TTG 

AGG CTA TCA TTG CAG TTC 3 ' . 

Amplification by PCR was for 35 cycles utilizing a regimen of 94°C (1 min), 60°C (1 min), and 72°C (2 min), with a 7 
minute extension at 72°C for the last cycle. The samples were then extracted with phenol:CHCI 3 , precipitated two 
times, and digested with EcoRL The PCR reaction products were analyzed by separation of the products by electro- 
phoresis on 6% polyacrylamide gels. DNA of approximately the estimated size of the expected PCR product was 
electroeluted from the gels, and subcloned into either a pGEM-4 plasmid vector or into lambda gtll. The expected 
product sizes for the EnvL and EnvR after the first round of amplification are 615 bp and 683 bp, respectively; after 
the second round of amplification the expected product sizes for EnvL and EnvR are 414 bp and 575 bp, respectively. 
The plasmids containing the amplified products were used to transform host cells; the pGEM-4 plasmid was used to 
transform DH5-alpha, and lambda gt11 was used to transform C600 delta-HFL. Clones of the transformed ceils which 
either hybridized to the appropriate HCV probes (described below), or those which had inserts of the correct size were 
selected. The inserts were then cloned in M13 and sequenced. 

[0203] The probes for all of the HCV/cPCR products consisted of 32 P labeled sections of HCV cDNA which had been 
prepared by PCR amplification of a region of clone 216 (using CA216a16A and 216a16B as primers), and of clone 84 
(using CA84a16A and CA84a16B or CA84a16C as primers); 32 P was introduced into the PCR products by nick trans- 
lation. The probes for the first and second round of EnvL amplification were from clone 216. Those for the first round 
of EnvR amplification were from 84 (i.e., CA84a16A and CA84a16B), for the second round of EnvL amplification were 
CA84a16A and CA84a16C. These probes did not overlap the primers used in the HCV/cPCR reactions. The sequence 
of the primers for the PCR amplification of the probes is in the following table. 
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Table 


Primer Clone Sequence 


CA216al6A 

216 

5' 

TGA 

ACT 

ATG 

CAA 

CAG 

G 

3' 

CA216al6B 

216 

5' 

GGA 

GTG 

TGC 

AGG 

ATG 

G 

3' 

CA84al6A 

84 

5' 

AAG 

GTT 

GCA 

ATT 

GCT 

C 

3' 

CA84al6B 

84 

5' 

ACT 

AAC 

AGG 

ACC 

TTC 

G 

3' 

CA84al6C 

84 

5' 

TAA 

CGG 

GTC 

ACC 

GCA 

T 

3' 


[0204] Sequence information on variants in the EnvL region was obtained from 3 clones from HCT #18, 2 clones 
from TH, 3 clones from EC1, and from the HCV1 clones described in E.P.O. Publication No. 318,216, and supra. A 
comparison of the composite nucleotide sequence of each isolate derived from these clones is shown in Fig. 49. In 
the figure, each sequence is shown 5* to 3' for the sense strand for the EnvL region, and the sequences have been 
aligned. The vertical lines and capital letters indicate sequence homology, the absence of a line and an uncapitalized 
letter indicates a lack of homology. The sequences shown in the lines are as follows: line 1, Thorn; line 2, EC1; line 3, 
HCT #18; line 4, HCV1. 

[0205] Sequence information on variants in the EnvR region was obtained from two clones of EC 10, and from the 
HCV1 clones described in E.P.O. Publication No. 318,216 and supra.. The two EC10 clones differed by only one 
nucleotide. A comparison of the nucleotide sequences of EC10(clone 2) and a composite of the HCV1 sequences is 
shown in Fig. 50; each sequence is shown 5' to 3' for the sense strand of the EnvR region, and the sequences have 
been aligned. The double dots between the sequences indicate sequence homology. 

[0206] A comparison of the amino acid sequences encoded in the EnvL (amino acids #117-308) and EnvR region 
(amino acids #300-438) for each of the isolates is shown in Fig. 51 and Fig. 52, respectively. Included in the Figures 
are sequences for the isolates JH23 and JH27, described supra. Also indicated are sequences from a Japanese isolate; 
these sequences were provided by Dr. T. Miyamura, Japan. In the figures, the amino acid sequence for the region is 
given in its entirety for HCV1, and the non-homologous amino acids in the various isolates are indicated. 
[0207] As seen in Fig. 51, In the EnvL region there is overall about a 93% homology between HCV1 and the other 
isolates. HCT18, Th, and EC1 have about a 97% homology with HCV1; JH23 and JH27 have about 96% and about 
95% homology, respectively, with HCV1. Fig. 52 shows that the homologies in the EnvR region are significantly less 
than in the EnvL region; moreover, one subregion appears to be hypervariable (i.e., from amino acid 383-405). This 
data is summarized in the Table immediately below. 


Table 


Homology of EnvR Region 

Isolate 

Percent Homology with HCV1 

AA330-AA438 

AA383-AA405 

JH23(U.S.) 

83 

57 

JH27(U.S.) 

80 

39 

Japanese 

73 

48 

EC10 (Italy) 

84 

48 


Detection of Positive and Negative Strand 5'-HCV RNA in Serum 

[0208] The RNA in HCV27, isolated from serum, was analyzed for the presence of positive and negative strands 
using the PCR method. The PCR method was performed essentially as described above, except for the following. The 
extracted HCV27 RNA was reverse transcribed into single-stranded cDNA using as a primer either Alex90 or JH52 
(see supra, for the sequences). The sequence of Alex90 matches that in nucleotides -312 to -283 of the positive strand 
of HCV RNA, whereas JH52 matches that of nucleotides -117 to -93 of the negative strand, the resulting single-stranded 
HCV cDNAs were each separately amplified by PCR using Alex90 and JH52. Detection of the amplified products was 
accomplished by Southern blotting, using Alex89 as the probe. Alex89 matches nucleotide numbers -203 to -175 of 
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HCV RNA. The sequence of Alex89 is: 


5 ' CCA TAG TGG TCT GCG GAA CCG GTG AGT ACA 3 ' . 

5 

The analysis indicated that, by this method, the signals of the amplified products of both RNA strands were of equal 
intensity. These results are suggestive that HCV RNA in the 5'-region may exist as double-stranded RNA. 

10 Detection of HCV Polynucleotide Sequences Using PCR Amplification 

[0209] In the generalized method for amplification of HCV RNA by cPCR it is contemplated that the RNA strand is 
a virion or mRNA strand, which is a "sense" strand. However, it is also possible that replicative intermediate forms may 
also be detected which would be "anti-sense"; in this case the primer would be "sense". An RNA sense strand containing 

15 the target region is hybridized with an anti-sense primer which primes the synthesis of the replicate strand containing 
the target. cDNA to the RNA template is synthesized with a primer- and template-dependent reverse transcriptase. 
The cDNA in the resulting RNAxDNA hybrid is released by denaturation and treatment with RNAse. Primers are 
annealed to the cDNA, and extended with a primer- and template-dependent DNA polymerase. The products are 
denatured, re-annealed to primers, and a second round of synthesis is conducted. A number of cycles are run until 

20 the amplified product containing the target region is in a desired amount, which is at least a detectable level. 

Detection of Amplified HCV Nucleic Acid Sequences derived from HCV Nucleic Acid Sequences in Liver and Plasma 
Specimens from Chimpanzees with NANBH 

25 [021 0] HCV nucleic acids present in liver and plasma of chimpanzees with NANBH, and not in control chimpanzees, 
were amplified using essentially the polymerase chain reaction (PCR) technique described by Saiki et al. (1986). The 
primer oligonucleotides were derived from the HCV cDNA sequences in clone 81 (Fig. 22), or clones 36 (Fig. 23) and 
37b (Fig. 24). The amplified sequences were detected by gel electrophoresis and a modified Southern blotting method, 
using as probes the appropriate cDNA oligomer or nick-translated cDNA sequence with a sequence from the region 

30 between, but not including, the two primers. 

[021 1] Samples of RNA containing HCV sequences to be examined by the amplification system were isolated from 
liver biopsies of three chimpanzees with NANBH, and from two control chimpanzees. The isolation of the poly A + RNA 
fraction was by the guanidinium thiocyanate procedure described in Maniatis et al. (1982). 

[021 2] Samples of RNA which were to be examined by the amplification system were also isolated from the plasmas 
35 of two chimpanzees with NANBH, and from one control chimpanzee, as well as from a pool of plasmas from control 
chimpanzees. One infected chimpanzee had a titer equal to or greater than 10 6 CID/ml, and the other infected chim- 
panzee had a titer equal to or greater than 10 5 CID/ ml. 

[0213] The nucleic acids were extracted from the plasma as follows. Either 0.1 ml or 0.01 ml of plasma was diluted 
to a final volume of 1.0 ml, with a TENB/ proteinase K/SDS solution (0.05 M Tris-HCL, pH 8.0. 0.001 M EDTA, 0.1 M 

40 NaCI, 1 mg/ml Proteinase K, and 0.5% SDS) containing 10 micrograms/ml polyadenylic acid, and incubated at 37°C 
for 60 minutes. After this proteinase K digestion, the resultant plasma fractions were deproteinized by extraction with 
TE (10.0 mM Tris-HCI, pH 8.0, 1 mM EDTA) saturated phenol. The phenol phase was separated by centrifugation, and 
was reextracted with TENB containing 0.1% SDS. The resulting aqueous phases from each extraction were pooled, 
and extracted twice with an equal volume of phenol/chloroform/isoamyl alcohol [1:1(99:1)], and then twice with an 

45 equal volume of a 99:1 mixture of chloroform/isoamyl alcohol. Following phase separation by centrifugation, the aque- 
ous phase was brought to a final concentration of 0.2 M Na Acetate, and the nucleic acids were precipitated by the 
addition of two volumes of ethanol. The precipitated nucleic acids were recovered by ultracentrifugation in a SW 41 
rotor at 38 K, for 60 minutes at 4°C. 

__ _ E?_ 2 1 4 L ,n add ' tion t0 tne above, the high titer chimpanzee plasma and the pooled control plasma alternatively were 

50 extracted with 50 micrograms of poly A carrTerby - the procedure - Df Chomcyzskiahd Sacchi (1 987). Thislprdcedure 
uses an acid guanidinium thiocyanate extraction. RNA was recovered by centrifugation at 10,000 RPM for 10 minutes 
at 4°C in an Eppendorf microfuge. 

[021 5] On two occasions, prior to the synthesis of cDNA in the PCR reaction, the nucleic acids extracted from plasma 
by the proteinase K/SDS/phenol method were further purified by binding to and elution from S and S Elutip-R Columns. 
55 The procedure followed was according to the manufacturer's directions. 

[0216] The cDNA used as a template for the PCR reaction was derived from the nucleic acids (either total nucleic 
acids or RNA) prepared as described above. Following ethano! precipitation, the precipitated nucleic acids were dried, 
and resuspended in DEPC treated distilled water. Secondary structures in the nucleic acids were disrupted by heating 
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at 65°C for 10 minutes, and the samples were immediately cooled on ice. cDNA was synthesized using 1 to 3 micro- 
grams of total chimpanzee RNA from liver, or from nucleic acids (or RNA) extracted from 1 0 to 1 00 microliters of plasma. 
The synthesis utilized reverse transcriptase, and was in a 25 microliter reaction, using the protocol specified by the 
manufacturer, BRL. The primers for cDNA synthesis were those also utilized in the PCR reaction, described below. All 

5 reaction mixtures for cDNA synthesis contained 23 units of the RNAase inhibitor, RNASIN™ (Fisher/Promega). Fol- 
lowing cDNA synthesis, the reaction mixtures were diluted with water, boiled for 10 minutes, and quickly chilled on ice. 
[0217] The PCR reactions were performed essentially according to the manufacturers directions (Cetus-Perkin- 
Elmer), except for the addition of 1 microgram of RNase A. The reactions were carried out in a final volume of 100 
microliters. The PCR was performed for 35 cycles, utilizing a regimen of 37°C (2 min), 72°C (3 min), and 94°C (1 min). 

10 [0218] The primers for cDNA synthesis and for the PCR reactions were derived from the HCV cDNA sequences in 
either clone 81, clone 36, or clone 37b. (The HCV cDNA sequences of clones 81, 36, and 37b are shown in Figs. 22, 
23, and 24, respectively.) The sequences of the two 16-mer primers derived from clone 81 were: 


75 

5' CAA TCA TAC CTG ACA G 3' 

and 

20 

5 ' GAT AAC CTC TGC CTG A 3 ' . 

The sequence of the primer from clone 36 was: 

25 

5 ' GCA TGT CAT GAT GTA T 3 ' . 


The sequence of the primer from clone 37b was: 

30 

5 ' ACA ATA CGT GTG TCA C 3 ' . 

35 in the PCR reactions, the primer pairs consisted of either the two 16-mers derived from clone 81 , or the 16-mer from 
clone 36 and the 16-mer from clone 37b. 

[0219] The PCR reaction products were analyzed by separation of the products by alkaline gel electrophoresis, 
followed by Southern blotting, and detection of the amplified HCV-cDNA sequences with a 32 P-labeled internal oligo- 
nucleotide probe derived from a region of the HCV cDNA which does not overlap the primers. The PCR reaction 

40 mixtures were extracted with phenol/ chloroform, and the nucleic acids precipitated from the aqueous phase with salt 
and ethanol. The precipitated nucleic acids were collected by centrifugation, and dissolved in distilled water. Aliquots 
of the samples were subjected to electrophoresis on 1.8% alkaline agarose gels. Single stranded DNA of 60, 108, and 
161 nucleotide lengths were co-electrophoresed on the gels as molecular weight markers. After electrophoresis, the 
DNAs in the gel were transferred onto Biorad Zeta Probe™ paper. Prehybridization and hybridization, and wash con- 

45 ditions were those specified by the manufacturer (Biorad). 

[0220] The probes used for the hybridization-detection of amplified HCV cDNA sequences were the following. When 
the pair of PCR primers were derived from clone 81 , the probe was an 108-mer with a sequence corresponding to that 
which is located in the region between the sequences of the two primers. When the pair of PCR primers were derived 
from clones 36 and 37b t the probe was the nick-translated HCV cDNA insert derived from clone 35, the nucleotide 

so sequence of which is shown in Fig. 34. The primers are derived from nucleotides 155-170 of the clone 37b insert, and 
206-268 of the clone 36 insert. The 3'-end of the HCV cDNA insert in clone 35 overlaps nucleotides 1-186 of the insert 
in clone 36; and the 5-end of clone 35 insert overlaps nucleotides 207-269 of the insert in clone 37b. (Compare Figs. 
23, 34 and 24.) Thus, the cDNA insert in clone 35 spans part of the region between the sequences of the clone 36 and 
37b derived primers, and is useful as a probe for the amplified sequences which include these primers. 

55 [0221] Analysis of the RNA from the liver specimens was according to the above procedure utilizing both sets of 
primers and probes. The RNA from the liver of the three chimpanzees with NANBH yielded positive hybridization results 
for amplification sequences of the expected size (161 and 586 nucleotides for 81 and 36 and 37b, respectively), while 
the control chimpanzees yielded negative hybridization results. The same results were achieved when the experiment 
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was repeated three times. 

[0222] Analysis of the nucleic acids and RNA from plasma was also according to the above procedure utilizing the 
primers and probe from clone 81. The plasmas were from two chimpanzees with NANBH, from a control chimpanzee, 
and pooled plasmas from control chimpanzees. Both of the NANBH plasmas contained nucleic acids/RNA which yield- 
5 ed positive results in the PCR amplified assay, while both of the control plasmas yielded negative results. These results 
have been repeatedly obtained several times. 

[0223] Defective viruses have been known to occur in RNA viruses. By using PCR technology it is possible to design 
primers to amplify sequences of the HCV genome. By analysis of the amplified products, it is expected to be able to 
identify both defective versions of the viral genome as well as wild-type viral species. Accordingly, using two primers 

io based on known HCV sequence, one can predict accurately the expected size of the PCR product. Any larger species 
observed by gel electrophoresis and hybridization analysis could represent potential variant genomes. Alternatively, 
any smaller species observed in this fashion might represent defective agents. Analyses of these types would be useful 
in confirming the exact origin of the known HCV sequence, whether it is indeed a wild-type viral sequence or a defective 
genome. Techniques and methods for these analyses are well known in the art and have been previously described. 

15 This methodology will enable one skilled in the art to obtain related (wild-type or defective) forms of the viral genome. 

Detection of Sequences in Captured Particles Which When Amplified by PCR Hybridize to HCV cDNA Derived from 
Clone 81 

20 [0224] The RNA in captured particles was obtained as described below. The analysis for sequences which hybridize 
to the HCV cDNA derived from clone 81 was carried out utilizing the PCR amplification procedure, as described supra., 
except that the hybridization probe was a kinased oligonucleotide derived from the clone 81 cDNA sequence. The 
results showed that the amplified sequences hybridized with the HCV cDNA probe. 

[0225] Particles were captured from HCV infected chimpanzee plasma using polystyrene beads coated with an im- 
25 munopurified antibody directed against the polypeptide encoded in clone 5-1-1. The procedure for producing the im- 
munopurified antibody preparation is described in E.P.O. Publication No. 318,216, which is commonly owned by the 
herein assignee, and which is incorporated herein by reference. Briefly, the HCV polypeptide encoded within clone 
5-1-1 was expressed as a fusion polypeptide with superoxide dismutase (SOD). This was accomplished by subcloning 
the clone 5-1-1 cDNA insert into the expression vector pSODcfl (Steimer et al. (1986)). DNA isolated from pSODcfl 
30 was treated with BamHI and EcoRI, and the following linker was ligated into the linear DNA created by the restriction 
enzymes: 

5' GAT CCT GGA ATT CTG ATA AGA 
35 CCT TAA GAC TAT TTT AA 3' 

After cloning, the plasmid containing the insert was isolated. Plasmid containing the insert was restricted with EcoRI. 

The HCV cDNA insert in clone 5-1-1 was excised with EcoRI, and ligated into this EcoRI linearized plasmid DNA. The 
40 DNA mixture was used to transform E. coli strain D1210 (Sadler et al. (1980)). Recombinants with the 5-1-1 cDNA in 

the correct orientation for expression of the ORF were identified by restriction mapping and nucleotide sequencing. 

Recombinant bacteria from one clone were induced to express the SOD-NANB^^ polypeptide by growing the bacteria 

in the presence of IPTG. The fusion polypeptide was purified from the recombinant E. coli by differential extraction of 

the cell extracts with urea, followed by chromatography on anion and cation exchange columns. The purified 
45 SOD-NANBg.-,.-, polypeptide was attached to a nitrocellulose membrane. Antibody in samples of HCV infected serum 

was absorbed to the matrix-bound polypeptide. After washing to remove non-specifically bound materials and unbound 

materials, the bound antibody was released from the bound polypeptide. 

cPCR Method to Detect.HCY. RNA in Liver and in-Serum from-lndividuals with NANBH. — 

50 

[0226] The reliability and utility of a modified form of the PCR assay, i.e., a cPCR assay, for detecting HCV infection 
was determined by performing the assay on total liver RNA and on serum from infected individuals. In the cPCR assay, 
putative viral RNA in the sample is reverse transcribed into cDNA with reverse transcriptase; a segment of the resulting 
cDNA is then amplified utilizing a modified version of the PCR technique described by Saiki et al. (1986). The primers 
55 for the cPCR technique are derived from HCV RNA, which can be identified by the family of HCV cDNAs provided 
herein. Amplified product corresponding to the HCV-RNA is detected utilizing a probe derived from the family of HCV 
cDNAs provided herein. 

[0227] The cPCR/HCV assay used in these studies were performed utilizing the following methods for the preparation 


38 


EP 0 398 748 B1 


of RNA, the reverse transcription of the RNA into cDNA, the amplification of specific segments of the cDNA by PCR, 
and the analysis of the PCR products. 

[0228] RNA was extracted from liver utilizing the guanidium isothiocyanate method for preparing total RNA described 
in Maniatis et al. (1982). 

5 [0229] In order to isolate total RNA from plasma, the plasma was diluted five- to ten-fold with TENB (0. 1 M NaCI, 50 
mM Tris-HCI, pH 8.0, 1 mM EDTA) and incubated in a Proteinase K/SDS solution (0.5% SDS, 1 mg/ml Proteinase K, 
20 micrograms/ml Poly A carrier) for 60 to 90 minutes at 37°C. The samples were extracted once with phenol (pH 6.5), 
the resulting organic phase was re-extracted once with TENB containing 0.1% SDS, and the aqueous phases of both 
extractions were pooled and extracted twice with an equal volume of phenol/CHCI 3 /isoamyl alcohol [1:1(99:1)]. The 

10 resulting aqueous phases were extracted with an equal volume of ChCl^isoamyl alcohol (99:1) twice, and ethanol 
precipitated using 0.2 M sodium acetate, pH 6.5, and 2.5 volumes of 100% ethanol; precipitation was overnight at - 
20°C. 

[0230] The cDNA used as a template for the PCR reaction was prepared utilizing the designated samples for prep- 
aration of the corresponding cDNAs. Each RNA sample (containing either 2 micrograms of heat denatured total chim- 
panzee liver RNA, RNA from 2 microliters of plasma, or 10% of the RNA extracted from 10mm X 4 mm cylindrical 
human liver biopsies) was incubated in a 25 microliter reaction containing 1 micromolar of each primer, 1 millimolar of 
each deoxyribonucleotide triphosphate (dNTP), 50 millimolar Tris-HCL, pH 8.3, 5 millimolar MgCI 2 , 5 millimolar dithi- 
othreitol (DTT), 73 millimolar KC1, 40 units of RNase inhibitor (RNASIN), and 5 units of AMV reverse transcriptase. 
The incubation was for 60 minutes at 37°C. Following cDNA synthesis, the reactions were diluted with 50 microliters 
20 of deionized water (DIW), boiled for 10 minutes, and cooled on ice. 

[0231] Amplification of a segment of the HCV cDNA was performed utilizing two synthetic oligomer 16-mer primers 
whose sequences were derived from HCV cDNA clones 36 (anti-sense) and 37b (sense). The sequence of the primer 
from clone 36 was: 

25 

5 ' GCA TGT CAT GAT GTA T 3 ' . 

The sequence of the primer from clone 37b was: 

30 

5 ' ACA ATA CGT GTG TCA C 3 ' . 


35 The primers were used at a final concentration of 1 micromolar each. In order to amplify the segment of HCV cDNA 
which is flanked by the primers, the cDNA samples were incubated with 0.1 microgram of RNAse A and the PCR 
reactants of the Perkin Elmer Cetus PCR kit (N801-0043 or N801-0055) according to the manufacturer's instructions. 
The PCR reaction was performed for either 30 cycles or 60 cycles in a Perkin Elmer Cetus DNA thermal cycler. Each 
cycle consisted of a 1 minute denaturation step at 94°C, an annealing step of 2 minutes at 37° C, and an extension 

^o step of 3 minutes at 72°C. However, the extension step in the final cycle (30 or 60) was 7 minutes rather than 3 minutes. 
After amplification the samples were extracted with an equal volume of phenol: chloroform (1:1), followed by extraction 
with an equal volume of chloroform, and then the samples were precipitated with ethanol containing 0.2 M sodium 
acetate. 

[0232] The cPCR products were analyzed as follows. The products were subjected to electrophoresis on 1.8% al- 
45 kaline agarose gels according to Murakawa et al. (1988), and transferred onto Zeta™ Probe paper (BioRad Corp.) by 
blotting gels overnight in 0.4 M NaOH. The blots were neutralized in 2 X SSC (1 X SSC contains 0.15 M NaCI, 0.015 
M sodium citrate), prehybridized in 0.3 M NaCI, 15 mM sodium phosphage buffer, pH 6.8, 15 mM EDTA, 1.0% SDS, 
0.5% nonfat milk (Carnation Co.), and 0.5 mg/ml sonicated denatured salmon sperm DNA. The blots to be analyzed 
for HCV cDNA fragments were hybridized to a 32 P-labeled probe generated by nick translation of the HCV cDNA insert 
50 sequence in clone 35, described in E.P.O. Publication No. 318,216. After hybridization, the blots were washed in 0,1 
X SSC ( 1 X SSC contains 0. 1 5M NaCI, 0.01 M Na citrate) at 65°C, dried, and autoradiographed. The expected product 
size is 586 nucleotides in length; products which hybridized with the probe and migrated in the gels in this size range 
were scored as positive for viral RNA. 

[0233] As a control, cPCR primers designed to amplify alpha-1 anti-trypsin mRNA was performed to verify the pres- 
55 ence of RNA in each sample analyzed. The coding region of the alpha-1 anti-trypsin gene is described in Rosenberg 
et al. (1984). Synthetic oligomer 16-mer primers designed to amplify a 365 nucleotide fragment of the coding region 
of the alpha-1 antitrypsin gene were derived from nucleotides 22-37 (sense) and nucleotides 372-387 (antisense). The 
PCR products were detected using a 32 P nick-translated probe which lies between, and not including, the cDNA/PCR 
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primer sequences. 

[0234] Due to the extreme sensitivity of the PGR reaction, all samples were run a minimum of three times. All false 
positive signals were eliminated when the following precautions were taken: 1) eliminating aerosols by using screw 
capped tubes with rubber 0-ring seals; 2) pipetting with Ranin Microman™ positive displacement pipetters with dis- 
5 posable pistons/capillaries; and 3) selecting the oligonucleotide sequences for the cDNA and PCR primers from two 
non-contiguous cDNA clones. 

Detection of HCV RNA in Liver Samples by a cPCR Method 

10 [0235] The cPCR assay was performed on total RNA isolated from livers of three chimpanzees experimentally in- 
fected with a NANBH agent, and from liver biopsies of Italian patients diagnosed as having chronic NANBH. 
[0236] Fig. 25A shows the results of the cPCR assay using 1 microgram of each preparation of total liver RNA. The 
RNA was isolated from liver samples of a chimpanzee in the chronic phase of NANBH (910)(lane 1), two chimpanzees 
in the acute phase of infection (1028 and 508)(lanes 2 and 3, respectively). PCR was performed on the samples in 

'5 lanes 1-3 for 30 cycles and the autoradiogram of the blot containing those lanes was exposed for 5 hours. cDNA from 
1 microgram of total RNA from acutely infected animal 1028 (lane 4), and three uninfected chimpanzees (lanes 5-7), 
were amplified for 60 cycles and the autoradiograms containing those lanes were exposed for 7 days. 32 P labeled 
Mspl-digested pBR322 DNA served as markers on all the autoradiograms. It may be seen from the results that cDNA 
corresponding to HCV RNA was seen only in the samples from chimpanzees with NANBH, whether acute or chronic 

20 (lanes 1 , 3, and 4). The cPCR products in these lanes migrated between marker fragments of 527 and 622 nucleotides 
(not shown). 

[0237] Fig. 25B shows the results of the cPCR assay using 10% of the RNA extracted from 10mm X 4mm liver biopsy 
cylinders from 15 chronic NANB patients (lanes 1-15), one patient with cryptogenic liver disease (lane 16) and one 
control sample from a patient with chronic Hepatitis B (lane 17). Amplification by PCR was for 30 cycles and the 
25 autoradiogram for the blots were exposed for 4 days, except that lane 1 was exposed for 15 hours. As seen from the 
results, 9/15 (60%) of the human samples were positive for HCV RNA (lanes 1,2,4,6,7,10-13). One patient diagnosed 
with cryptogenic liver disease (lane 1 6) and one patient with a chronic HBV infection (lane 1 7) were repeatedly negative 
in the cPCR assay. 

30 Comparison of the HCV/cPCR Assay on Human Liver Biopsies and RIA of Serum Using HCV C 100-3 Polypeptide 

[0238] SOD/HCV C100-3 polypeptide (also called C100) is a recombinant fusion polypeptide which contains 363 
viral amino acids. The polypeptide is useful for detecting antibodies to HCV (See Kuo et al. (1989)). The method for 
preparing C100 is described in E.P.O. Publication No. 318,216. 

35 [0239] Radioimmune assay using C100 was performed on the sera collected from the same 17 human patients 
whose liver samples were subjected to HCV/cPCR assay as described supra. The sera was collected on the same 
day as the liver biopsies. The assay was performed essentially as described in E.P.O. Publication No. 318,216, which 
is commonly owned and incorporated herein by reference. Briefly, Microliter plates (Immulon 2, Removeaweil strips) 
were coated with 0.1 microgram of purified C100. The coated plates were incubated for 1 hour at 37°C with the serum 

40 samples (100 microliters of a 1:100 dilution) or appropriate controls. After incubation, the unbound material was re- 
moved, the plates were washed, and complexes of human antibody-C1 00 were detected by incubation with 125 l-labeled 
sheep anti-human immunoglobulin. Unbound labeled antibody was removed by aspiration, and the plates were washed. 
The radioactivity in individual wells was determined. 

[0240] The results of the RIA showed that sixty-seven percent of these samples were positive for anti-C 1 00 antibod- 
45 jes. Sera from the patient diagnosed with cryptogenic liver disease was positive for anti-C 100 antibodies, although the 

levels of viral RNA were undetectable in the patient's liver in this sample. The level of correlation between the presence 

of anti-C 100 antibodies and HCV RNA was seventy percent; two patients who were negative for antibodies by RIA 

had significant levels of HCV RNA in their livers (data not shown). 

[0241] The results indicate that virus is frequently present in the liver of patients with circulating anti-C 100 antibodies, 

50 and confirms claims that the presence of anti-C 100 antibodies accuratelyTeflects exposurelo HCV. Moreover, taken - 

together, these results indicate that HCV of this type accounts for NANBH in at least 75% of the patients in this study, 

and that the predominant strain of HCV in Italy appears to be closely related to the strain of HCV prevalent in the United 

States. 

55 HCV/cPCR Assay of Sera: Detection of Viral RNA in Acute Phase Infection in Chimpanzees 

[0242] The temporal relationship between the display of liver damage, the presence of HCV RNA, and the presence 
of anti-HCV antibodies was monitored in serum from two experimentally infected chimpanzees with NANBH (nos. 771 


40 


EP 0 398 748 B1 


and 910). Liver damage was determined by alanine amino transferase (ALT) levels; the presence of HCV RNA was 
determined by the HCV cPCR assay described above; anti-HCV antibodies were detected utilizing the C100 RIA. 
[0243] The HCV/cPCR analysis was performed on RNA extracted from 1 microliter of chimpanzee plasma. Serum 
was taken from chimpanzee 771 on days 25, 32, 70 and 88 post- infection; cPCR was performed for 30 cycles and the 
5 autoradiogram was exposed for 18 days. Serum was taken from chimpanzee 910 on days 11, 28, and 67 post-infection; 
cPCR was performed for 60 cycles and the autoradiogram was exposed for 5 days. 

[0244] The results of the assays are shown in Fig. 26A for chimpanzee 771 1 and Fig. 26B for chimpanzee 910. From 
a comparison of Figs. 26A and 26B, it appears that an early, well defined peak of ALT values during acute hepatitis 
correlates with the presence of viral RNA in the infected individual. 

10 [0245] The data also indicate that the presence of HCV RNA, which is indicative of a state of viremia, precedes the 
presence of anti-HCV antibodies. Chimpanzee 771 (Fig. 26A) exhibited a clearly defined acute episode of post-trans- 
fusion NANBH at 28 days, as characterized by an initial peak of ALT levels. HCV RNA was detected in the serum 
collected at day 25, and at day 32. However, during this acute phase, anti-HCV antibodies were absent. In contrast, 
at day 70 HCV RNA was below the experimental level of detection, and anti-HCV antibodies were rising. At day 88, 

15 HCV RNA remained undetectable, while anti-HCV antibodies were significantly increased over that of day 70. 

[0246] The results obtained from the sera of chimpanzee 910 were somewhat similar in pattern, although the time 
of HCV antibodies induced by the infection were not detected during the acute phase of the disease, which extended 
to at least day 67; the anti-HCV antibodies detected by RIA at day 11 were due to passive immunization of animal 910 
with antibodies from the plasma used to inoculate the animal. Anti-HCV antibodies were found in chimpanzee 910 

20 serum during the later, chronic phase of the infection (data not shown). 

[0247] It should be noted that low ALT values in plasma from individuals with chronic NANBH do not necessarily 
correlate with weak virus production. A pool of 17 different plasma samples taken from chimpanzee 910 over a period 
of two to three and one-half years post inoculation was monitored for ALT levels and for HCV RNA. The ALT values 
of the samples did not exceed 45 mU/ml; nevertheless, titration studies indicated high titers of HCV (3 x 10 6 CID/ml). 

25 cPCR was carried out for 30 cycles, and the autoradiogram was exposed for 15 hours; the cPCR analysis clearly 
showed the presence of viral RNA (data not shown). 

HCV/cPCR Assay of Sera: Detection of Viral RNA in Acute Phase Infection in Humans 

30 [0248] Plasma from a human surgical patient collected during early acute NANBH was examined for HCV RNA and 
for anti-HCV antibodies, utilizing the HCV/cPCR assay and C100-RIA, respectively. The HCV/cPCR assay was con- 
ducted utilizing 1 microliter of plasma from the patient, and from four human controls with known pedigrees; cPCR was 
performed for thirty cycles, and after hybridization and washing the autoradiogram was exposed for eight hours. 
[0249] The results showed that the serum collected from the surgical patient during the acute phase of infection 

35 contained a high level of viral RNA, and that anti-HCV antibodies were not detectable by the C100-RIA (data now 
shown). (The acute phase plasma from the surgical patient was known to have a high titer of NANBH infectious agent 
[10 6 5 CID/ml, as determined by Feinstone et al. (1981); Feinstone et at. (1983)]). It should be noted, however, that 
this patient did sero-covert to anti-HCV antibodies by the C100-RIA approximately 9 months after infection. The serum 
from the pedigreed human control plasmas were negative in both the HCV/cPCR assay and C100-RIA. 

40 

Sensitivity of HCV/cPCR Assay 

[0250] The sensitivity of the HCV/cPCR assay was determined by analyzing ten-fold serial dilutions of a plasma pool 
of known titer. The chimpanzee plasma had a titer of ~ 3 x 10 5 CID/ml, and RNA was extracted from ten-fold dilutions 
45 of 1 microliter of the plasma. cPCR was performed for 30 cycles, and after hybridization and washing, the autoradiogram 
was exposed for 15 hours. The cPCR products resulting from amplification of —300, —30, and —3 CID of HCV genomes 
are shown in lanes 1-3, respectively of Fig. 29. The samples in lanes 1 and 2 were detectable on autoradiograms 
exposed for 2 hours. 

[0251] Since the average titer of HCV in infected individuals is believed to be between approximately 100 to 10,000 
50 CID/ml of plasma, this data suggests that the HCV/ cPCR assay may be clinically useful. 

HCV/cPCR Assay for Variant HCV Strains 

[0252] Primers, consisting of a set of oligomer 44-mers and a set of oligomer 45-mers, were designed to amplify 
55 strains of HCV which are similar or identical to the HCV isolate from which the cDNA sequence in Fig. 18 is derived. 
The premise underlying the design of these primers is our discovery that HCV is a Flavi-like virus. Members of the 
Flaviviridae family, when compared to HCV, have two major conserved sets of amino acid sequences, TATPPG and 
QRRGR, in the putative NS3 region of these viruses. Several other smaller sets may be seen, for example, GDD in 
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the putative NS5 region. Other sets are determinable by comparison of the known amino acid sequences with that of 
HCV. This information was deduced from the sequences for several members of Flaviviridae which have been de- 
scribed, including Japanese Encephalitis Virus (Sumiyoshi et al. (1 987)), Yellow Fever Virus (Rice et al. (1985)), Dengue 
Type 2 Virus (Hahn et al. (1988)), Dengue Type 4 Virus (Mackow (1987)), and West Nile Virus (Castle et al. (1986)). 
The conserved amino acid sequences and codon utilization are in the table immediately following. 


Conserved Amino Acid (A. A. ) Sequences 
Among Flaviviruses and HCV 


# of A. A. 


Virus 

first A. A. 


T 

A 

T 

P 

P 

G 

HCV 

1348 

5' 

ACC 

GCC 

ACC 

CCT 

CCG 

GCC 3 

Yellow Fever 1805 


ACA 

GCC 

ACA 

CCG 

CCT 

GGG 

West Nile 

1818 


ACG 

GCA 

ACG 

CCA 

CCC 

GGG 

Dengue-4 

1788 


ACC 

GCA 

ACC 

CCT 

CCC 

GGA 

JEV 

1957 


ACA 

GCG 

ACC 

CCG 

CCT 

GGA 

HCV sense 

primer (44mer) 









5' 

ACC 

GCC 

ACC 

CCX 

CC 3' 






(X - A, 

T,C, 

or G) 


# of 




A. 

A. 



Virus 

first A. A. 


Q 

R 

R 

G 

R 


HCV 

1486 

5' 

CAA 

CGT 

CGG 

GGC 

AGG 

3' 

Yellow Fever 1946 


CAA 

AGG 

AGG 

GGG 

CGC 


West Nile 

1959 


CAG 

CGG 

AGA 

GGA 

CGC 


Dengue-4 

1929 


CAG 

AGA 

AGA 

GGG 

CGA 


JEV 

1820 


CAA 

CGG 

AGG 

GGC 

AGA 



HCV antisense primer (45mer)= 

3' GTX GCA GCC CCG TCC 5' 
(X - T or C) 

Note: the primer sequence was chosen to minimize the 
number of nucleotide degeneracies at the 3 '-end of the 
primer sequence and to maximize the number of nucleotides 
at the 3 '-end of each primer which exactly match any of 
the possible nucleotide sequences, or the complement 
thereof, encoding the conserved amino acids indicated 
above • 


[0253] The 44-mer and 45-mer oligomer primers were designed so that the sequences encoding these amino acids 
were incorporated within the primer. Moreover, they contain degeneracies at the 3'-end of each primer, and are derived 
from two different regions of the HCV genome which are present in clone 40b (See Fig. 28), and which are derived 
from the region encoding putative NS3 of HCV. The formulae for the oligonucleotide primers in the sets are: 


5' GAC TGC GGG GGC GAG ACT GGT TGT GCT CGC 
ACC GCC ACC CCX CC 3' 


where X is A,T,G, or C; and 
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5' TCT GTA GAT GCC TGG CTT CCC CCT GCC AGT 
CCT GCC CCG ACT YTG 3' 

where Y is T or C. 

[0254] The HCV/cPCR assay was carried out utilizing these primers to amplify HCV RNA in chimpanzee 910 plasma. 
The assay method was essentially as described in Section supra., except that the 44-mer and 45-mer sets of oligomer 
primers were substituted for the primers derived from clone 36 and clone 37b. In addition, detection of amplified HCV 
cDNA was by hybridization with a probe derived from clone 40a, the sequence of which is shown in Fig. 32. 
[0255] The probe was prepared by amplifying a segment of clone 40a utilizing the PCR method described supra., 
and 18-mer primers containing the following sequences: 


5' GAG ACA TCT CAT CTT CTG 3' 

and 


5 ' GAG CGT GAT TGT CTC AAT 3 ' . 

After amplification, the probe preparation was labeled with 32 P by nick translation. 

[0256] Fig. 33 shows an autoradiograph of the Southern blots probed with the sequence derived from Clone 40a. 32 P 
labeled Mspl digested pBR322 DNA fragments served as markers (lane 1). The predicted size of the PCR product 
resulting from amplification using these primers is 490 nucleotides (nt). Duplicate reactions are shown in lanes 2 and 3. 

Analysis for Variants of the S'-Region of HCV 

[0257] Based upon the Flavivirus model, the S'-region HCV cDNA which is flanked by the regions represented in 
clones ag30a and k9-1 encodes a segment of putative envelope and/or matrix protein(s) (E/M). Serum obtained from 
the chimpanzee from which the HCV cDNA "c" library, was constructed was analyzed by HCV/cPCR to determine 
whether variants within this target region were present. 

[0258] The HCV/cPCR assay was performed essentially as described supra., for the isolation of clone 5'-32, except 
for the primers and probes used. Fig. 37 shows the relationship of the primers and probes (and the clones from which 
they were derived) to that of the target region of HCV cDNA. One set of PCR primers, ag30a16A and K91Env16B, 
were derived from clones ag30a and k9-1, which are upstream and downstream, respectively, of the target sequence. 
The expected size of the cPCR product primed by ag30a16A and K91Env16B is 1.145 kb based upon the confirmed 
sequence of HCV cDNA. Two other sets of PCR primers covering the region amplified using ag30a1 6A and K91 Env1 6B, 
and overlapping each other were also used for PCR amplification of HCV RNA in the serum. Thus, in this case the 
PCR reactions were run using as one set of primers ag30a16A and CA156e16B, and as the second set of primers 
CA156e16A and k91Env16B. The expected PCR product sizes for these pairs were 615 nucleotides (NT) and 683 NT, 
respectively. The table immediately following lists the primer, the clone from which it was derived, and the primer 
sequence. 
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Primer Clone ; Sequence 


ag30al6A 

ag30a 

5' 

CTC 

TAT 

GGC 

AAT 

GAG 

G 

3' 

K91Envl6B 

k9-l 

5' 

CGT 

TGG 

CAT 

AAC 

TGA 

T 

3' 

CA156el6B 

156 

5' 

CGA 

CAA 

GAA 

AGA 

CAG 

A 

3' 

CA156el6A 

156 

5' 

AGC 

TTC 

GAC 

GTC 

ACA 

T 

3' 

CA216al6A 

216 

5' 

TGA 

ACT 

ATG 

CAA 

CAG 

G 

3' 

CA216al6B 

216 

5' 

GGA 

GTG 

TGC 

AGG 

ATG 

G 

3' 

CA84aL6A 

84 

5' 

AAG 

GTT 

GCA 

ATT 

GCT 

C 

3' 

CA84al6B 

84 

5' 

ACT 

AAC 

AGG 

ACC 

TTC 

G 

3' 


The probes for alt of the HCV/cPCR products consisted of 32 P labeled sections of HCV cDNA which had been prepared 
by PCR amplification of a region of clone 216 (using CA216a16A and 216a16B as primers), and of clone 84 (using 
CA84a16A and CA84a16B as primers); 32 P was introduced into the PCR products by nick translation. These probes 
did not overlap the primers used in the HCV/cPCR reactions. 

[0259] Fig. 38 shows an autoradiograph of a Southern blot in which the HCV/cPCR products were hybridized with 
the 32 P-labeled probes. The HCV/cPCR product extended from primers ag30a16A and K91Env16B (lane 1) was ap- 
proximately 1.1Kb; no other PCR products were observed in a 15 hour exposure. The HCV products extended from 
the primer sets ag30a15A/CA156e16B (lane 2) and CA156e16A/ K91 Env16B (lane 3) were approximately 625NT and 
approximately 700 NT, respectively. The size of the PCR products were determined by comparison with the relative 
migrations of fragments resulting from the digestion of pBR322 with Mspl and of PhiX 174 digested with Haelll (lane 5). 
[0260] The above study will detect insertions or deletions as small as approximately 20NT to 50NT and DNA rear- 
rangements altering the size of the target DNA. The results in Fig. 38 confirm that there is only 1 major species of 
cDNA derived from the E/M region of the HCV in the chimpanzee serum. 

Amplification for Cloning of HCV cDNA Sequences Utilizing the PCR and Primers Derived from Conserved Regions 
of Flavivirus Genomic Sequences 

[0261] Our discovery that HCV is a flavi-like virus, allows a strategy for cloning uncharacterized HCV cDNA sequenc- 
es utilizing the PCR technique, and primers derived from the regions encoding conserved amino acid sequences in 
flaviviruses. Generally, one of the primers is derived from a defined HCV genomic sequence, and the other primer 
which flanks a region of unsequenced HCV polynucleotide is derived from a conserved region of the flavivirus genome. 
The flavivirus genomes are known to contain conserved sequences within the NS1, and E polypeptides, which are 
encoded in the 5-region of the flavivirus genome. Thus, to isolate cDN A sequences derived from putatively comparable 
regions of the HCV genome, upstream primers are designed which are derived from the conserved sequences within 
these flavivirus polypeptides. The downstream primers are derived from an upstream end of the known portion of the 
HCV cDNA. 

[0262] Because of the degeneracy of the code, it is probable that there will be mismatches between the flavivirus 
probes and the corresponding HCV genomic sequence. Therefore a strategy which is similar to the one described by 
Lee (1988) is used. The Lee procedure utilizes mixed oligonucleotide primers complementary to the reverse translation 
products of an amino acid sequence; the sequencesjnjhe^ into account every codon degeneracy 

for the conserved amino acid sequence. ~~ ~~ 

[0263] Three sets of primer mixes are generated, based on the amino acid homologies found in several flaviviruses, 
including Dengue-2 f 4 (D-2,4), Japanese Encephalitis Virus (JEV), Yellow Fever (YF), and West Nile Virus (WN). The 
primer mixture derived from the most upstream conserved sequence (5*-1), is based upon the amino acid sequence 
gly-trp-gly, which is part of the conserved sequence asp-arg-gly-trp-gly-aspN found in the E protein of D-2, JEV, YF, 
and WN. The next primer mixture (5'-2) is based upon a downstream conserved sequence in E protein, phe-asp-gly- 
asp-ser-tyr-ileu-phe-gly-asp-ser-tyr-ileu, and is derived from phe-gly-asp; the conserved sequence is present in D-2, 
JEV, YF, and WN. The third primer mixture (5'-3), is based on the amino acid sequence arg-ser-cys, which is part of 
the conserved sequence cys-cys-arg-ser-cys in the NS1 protein of D-2, D-4, JEV, YF, and WN. The individual primers 
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which form the mixture in 5-3 are shown in Fig. 53. In addition to the varied sequences derived from conserved region, 
each primer in each mixture also contains a constant region at the 5-end which contains a sequence encoding sites 
for restriction enzymes, Hindlll, Mbol, and EcoRI. 

[0264] The downstream primer, ssc5h20A, is derived from a nucleotide sequence in clone 5h, which contains HCV 
5 cDNA with sequences with overlap those in clones 14i and 11b. The sequence of ssc5h20A is 


5 ' GTA ATA TGG TGA CAG AGT CA 3'. 

10 

An alternative primer, ssc5h34A, may also be used. This primer is derived from a sequence in clone 5h, and in addition 
contains nucleotides at the 5'-end which create a restriction enzyme site, thus facilitating cloning. The sequence of 
ssc5h34A is 

15 

5 ' GAT CTC TAG AGA AAT CAA TAT GGT GAC AGA GTC A3''. 


20 [0265] The PCR reaction, which was initially described by Saiki et al. (1986), is carried out essentially as described 
in Lee et al. (1988), except that the template for the cDNA is RNA isolated from HCV infected chimpanzee liver, or 
from viral particles isolated from HCV infected chimpanzee serum. In addition, the annealing conditions are less strin- 
gent in the first round of amplification (0.6M NaCI, and 25°C), since the part of the primer which will anneal to the HCV 
sequence is only 9 nucleotides, and there could be mismatches. Moreover, if ssc5h34A is used, the additional se- 

25 quences not derived from the HCV genome tend to destabilize the primer-template hybrid. After the first round of 
amplification, the annealing conditions can be more stringent (0.066M NaCI, and 32°C-37°C), since the amplified 
sequences now contain regions which are complementary to, or duplicates of the primers. In addition, the first 1 0 cycles 
of amplification are run with Klenow enzyme I, under appropriate PCR conditions for that enzyme. After the completion 
of these cycles, the samples are extracted, and run with Taq polymerase, according to kit directions, as furnished by 

30 Cetus/Perkin-Elmer. 

[0266] After the amplification, the amplified HCV cDNA sequences are detected by hybridization using a probe de- 
rived from clone 5h. This probe is derived from sequences upstream of those used to derive the primer, and does not 
overlap the sequences of the clone 5h derived primers. The sequence of the probe is 

35 

5' CCC AGC GGC GTA CGC GCT GGA CAC GGA GGT GGC CGC GTC 
GTG TGG CGG TGT TGT TCT CGT CGG GTT GAT GGC GC 3'. 

40 

Industrial Applicability 

[0267] The methods described herein, as well as the oligomer primers, derived from HCV cDNA, and kits containing 
them, are useful for the accurate, relatively simple, and economic determination of the presence of HCV in biological 

45 samples, more particularly in blood which may be used for transfusions, and in individuals suspected of having HCV 
an infection. Moreover, these methods and oligomers may be useful for detecting an earlier stage of HCV infection 
than are immunological assays based upon the use of a recombinant HCV polypeptides. Also, an amplified polynu- 
cleotide hybridization assay detects HCV RNA in occasional samples which are anti-HCV antibody negative. Thus, 
the probes and primers described herein may be used amplified hybridization assays, in conjunction with an immu- 

50 noassays based on HCV polypeptides to more completely identify infections due to HCV, and HCV-infected biological 
specimens, including blood. 

[0268] The information provided herein allows the design of primers which are derived from conserved regions of 
the HCV genome. The provision of these primers makes available a general method which will detect variant HCV 
strains, and which will be of use in the screening of blood and blood products. 
55 [0269] If the primers used in the method are derived from conserved regions of the HCV genome, the method should 
aid in the detection and/or identification of variant strains of HCV. This, in turn, should lead to the development of 
additional immunological reagents for the detection and diagnosis of HCV, as well as the development of additional 
polynucleotide reagents for detection and or treatment of HCV 
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[0270] In addition, sets of primers and probes designed from the conserved amino acid sequences of Flaviviruses 
and HCV allow for a universal detection method for these infectious agents. 

[0271] The following listed materials are on deposit under the terms of the Budapest Treaty with the American Type 
Culture Collection (ATCC), 12301 Parklawn Dr., Rockville, Maryland 20852, and have been assigned the following 
5 Accession Numbers. 


15 


20 


Iambda-gt11 

ATCC No. 

Deposit Date 

HCV cDNA library 

40394 

1 Dec. 1987 

clone 81 

40388 

17 Nov. 1987 

clone 91 

40389 

17 Nov. 1987 

clone 1-2 

40390 

17 Nov. 1987 

clone 5-1-1 

40391 

18 Nov. 1987 

clone 12f 

40514 

10 Nov. 1988 

clone 35f 

40511 

10 Nov. 1988 

clone 15e 

40513 

10 Nov. 1988 

clone K9-1 

40512 

10 Nov. 1988 

JSC 308 

20879 

5 May 1988 

pS356 

67683 

29 April 1988 


In addition, the following deposits were made on 11 May 1989. 
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Strain 

Linkers 

ATCC No. 

D1210(Cf1/5-1-1) 

EF 

67967 

D1210(Cf1/81) 

EF 

67968 

D1210(Cf1/CA74a) 

EF 

67969 

D1210(Cf1/35f) 

AB 

67970 

D1210(Cf1/279a) 

EF 

67971 

D1210(Cf1/C36) 

CD 

67972 

D1210(Cf1/13i) 

AB 

67973 

D1210(Cf1/C33b) 

EF 

67974 

D1210(Cf1/CA290a) 

AB 

67975 

HB101 <AB24/C100#3R) 


67976 


The following derivatives of strain D1210 were deposited on 3 May 1989. 


45 


Strain Derivative 

ATCC No. 

pCF1CS/C8f 

67956 

pCF1AB/C12f 

67952 

pCF1EF/14c 

67949 

pCF1EF/15e 

67954 

pCF1AB/C25c 

67958 

pCF1EF/C33c 

67953 

pCF1EF/C33f 

67050 

pCF1CD/33g 

67951 

pCF1CD/C39c - 

- 67955 - 

pCF1EF/C40b 

67957 

pCF1EF/CA167b 

67959 


The following strains were deposited on May 12, 1989. 

55 


46 


EP 0 398 748 B1 


Strain 

ATCC No. 

Lambda gt11(C35) 

40603 

Lambda gt10(beta-5a) 

40602 

D1210(C40b) 

67980 

D1210(M16) 

67981 


10 


The following biological materials were deposited on March 23, 1990. 


Material 

ATCC No. 

5'-clone32 (in pUC18S) 

68276 


15 


20 


25 


30 


35 


40 


45 


Claims 

1. A method for the detection of variant hepatitis C virus (HCV) strains in a sample which method comprises: 

a) subjecting the sample to a polymerase chain reaction (PCR), using primers of sufficient length to prime the 
synthesis of extension products in the presence of an agent for polymerization wherein at least one primer is 
capable of selectively hybridizing to a conserved region of an HCV genome as shown in Figure 18 or comple- 
ment thereof, 

and 

b) analysing the size or sequence of any amplified product. 

2. A method according to claim 1 wherein said conserved region(s) of HCV genome is NS3, or NS5. 

3. A method according to claim 1 wherein said conserved region of the HCV genome is from nucleotide -318 to 174. 

4. A method according to claim 1 , 2 or 3 wherein at least one primer is capable of hybridizing to a conserved region 
of a flavivirus genome. 

5. A method according to claim 4 wherein said conserved region of the flavivirus genome is within the sequence 
coding for NS1 or E polypeptides. 

6. A method according to any one of claims 2 to 5 wherein at least one primer comprises (i) a nucleotide sequence 
which encodes the polypeptide sequence TATPPG or QRRGR or (ii) the complement of the nucleotide sequence 
of(i). 

7. A method according to claim 6 wherein the primer comprises the sequence: 
5' ACC GCC ACC CCX CC3' where X is any nucleotide; or 

5' CCT GCC CCG ACGXTG3' where X is T or C. 

8. A method according to claim 4 or 5 wherein at least one primer comprises (i) a nucleotide sequence which encodes 
the polypeptide sequence GWG, FGD or RSC (ii) the complement of the nucleotide sequence of(i). 

9. A method according to claim 8 wherein the primer is an oligonucleotide primer mixture comprising the individual 
sequences shown in Figure 53. 


so 10. A method according to claim 1 wherein at least one primer is selected from: 
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5' CTC TAT GGC AAT GAG G3'; 

5' CGT TGG CAT AAC TGA T3'; 

5' CGA CAA GAA AGA GAG A3'; 

5' AGC TTC GAC GTC ACA T3'; 

5* TGA ACT ATG CAA CAG G3'; 

5* GGA GTG TGC AGG ATG G3*; 

5 AAG GTT GCA ATT GCT C3'; 

5' ACT AAC AGG ACC TTC G3'; 

5' GTA ATA TGG TGA CAG AGT CA3' and 

5' GAT CTC TAG AGA AAT CAA TAT GOT GAC AGA GTC A3'. 

11. A polypeptide primer selected from: 

5' CTC TAT GGC AAT GAG G3'; 
5' CGT TGG CAT AAC TGA T3'; 
5« CGA CAA GAA AGA CAG A3'; 
5' AGC TTC GAC GTC ACA T3'; 
y TGA ACT ATG CAA CAG G3'; 
5' GGA GTG TGC AGG ATG G3'; 
y AAG GTT GCA ATT GCT C3'; and 
5* ACT AAC AGG ACC TTC G3\ 

Patentanspriiche 

1. Verfahren zum Nachweis von verschiedenen Hepatitis-C-Virus (HCV)-Stammen in einer Probe, wobei das Ver- 
fahren die folgenden Schritte umfasst: 

(a) Einsetzen der Probe in einer Polymerase-Kettenreaktion (PCR) unter Verwendung von Primern von ge- 
eigneter Lange, urn die Synthese von Verlangerungsprodukten in Anwesenheit eines Polymerisierungsagens 

zu starten, wobei wenigstens ein Primer in der Lage ist, selektiv an eine konservierte Region eines wie in Figur 

18 dargestellten HCV^Genomsoter^m Komplement davbrTzu"hybridisieren;^nd~ ~ 

(b) Analysieren der GroRe oder Sequenz jedes amplifizierten Produkts. 

2. Verfahren nach Anspruch 1, wobei die konservierte(n) Region(en) des HCV-Genoms NS3 Oder NS5 ist (sind). 

3. Verfahren nach Anspruch 1, wobei die konservierte Region des HCV-Genoms zwischen Nucleotid -318 bis 174 
liegt. 

4. Verfahren nach einem der Anspruche 1 , 2 oder 3, wobei wenigstens ein Primer in der Lage ist, an eine konservierte 
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Region eines Flavivirusgenoms zu hybridisieren. 

5. Verfahren nach Anspnjch 4, wobei die konservierte Region des Flavivirusgenoms sich in der Sequenz befindet, 
welche NS1- oder E-Polypeptide codiert. 

6. Verfahren nach einem der Anspruche 2 bis 5, wobei wenigstens ein Primer (i) eine Nucleotidsequenz, welche die 
Polypeptidsequenz TATPPG Oder QRRGR codiert, oder (ii) den Komplementarstrang der Nucleotidsequenz aus 
(i) umfasst. 

7. Verfahren nach Anspruch 6, wobei der Primer die Sequenz 

5' ACC GCC ACC CCX CC3* umfasst, wobei X irgendein Nucieotid ist; oder 
5' CCT GCC CCG ACGXTG3* umfasst, wobei X T oder C ist 

8. Verfahren nach Anspruch 4 oder 5, wobei wenigstens ein Primer (i) eine Nucleotidsequenz, welche die Polypep- 
tidsequenz GWG, FGD oder RSC codiert, oder (ii) den Komplementarstrang der Nucleotidsequenz von (i) umfasst. 

9. Verfahren nach Anspruch 8, wobei der Primer eine Oligonucleotidprimer-Mischung ist, umfassend die einzelnen 
in Figur 53 dargestellten Sequenzen. 

10. Verfahren nach Anspruch 1, wobei wenigstens ein Primer ausgewahit ist aus: 

5' CTC TAT GGC AAT GAG G3'; 

5* CGT TGG CAT AAC TGA T3'; 

5' CGA CAA GAA AGA CAG A3'; 

5' AGC TTC GAC GTC ACA T3'; 

5* TGA ACT ATG CAA CAG G3'; 

5' GGA GTG TGC AGG ATG G3*; 

5' AAG GTT GCA ATT GCT C3'; 

5' ACT AAC AGG ACC TTC G3' 

5' GTA ATA TGG TGA CAG AGT CA3' und 

5' GAT CTC TAG AGA AAT CAA TAT GGT GAC AGA GTC A3'. 

11. Primer ausgewahit aus: 

5' CTC TAT GGC AAT GAG G3'; 
5' CGT TGG CAT AAC TGA T3'; 
5' CGA CAA GAA AGA CAG A3'; 
5' AGC TTC GAC GTC ACA T3'; 
5' TGA ACT ATG CAA CAG G3'; 
5' GGA GTG TGC AGG ATG G3'; 
5' AAG GTT GCA ATT GCT C3'; und 
5' ACT AAC AGG ACC TTC G3'. 
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Revendications 


1. Procede pour detecter des souches variantes du virus de I'hepatite C (HCV) dans un echantillon, lequel procede 
comporte les etapes consistant a : 

5 

a) soumettre les echantillons a une reaction en chalne de la polymerase (PCR), en utilisant des amorces ayant 
une longueur suffisante pour amorcer la synthese de produits d'extension en presence d'un agent de poly- 
merisation, au moins une amorce etant capable de s'hybrider de maniere selective a une region conservee 
d'un genome HCV comme represents sur la figure 18 ou d'un complement de celui-ci, et 
10 b) analyser la taille ou la sequence de tout produit amplifie. 

2. Procede selon la revendication 1, dans lequel ladite ou lesdites regions conservees du genome HCV sont NS3, 
ou NS5. 


15 3. Procede selon la revendication 1, dans lequel ladite region conservee du genome HCV est comprise entre les 
nucleotides -318 et 174. 


4. Procede selon la revendication 1 , 2 ou 3, dans lequel au moins une amorce est capable de s'hybrider a une region 
conservee d'un genome de flavivirus. 

20 

5. Procede selon la revendication 4, dans lequel ladite region conservee du genome de flavivirus se trouve dans la 
sequence codant pour des polypeptides NS1 ou E. 

6. Procede selon Tune quelconque des revendications 2 a 5, dans lequel au moins une amorce comporte (i) une 
25 sequence nucleotidique qui code pour la sequence polypeptidique TATPPG ou QRRGR ou (ii) le complement de 

la sequence nucleotidique de (i). 

7. Procede selon la revendication 6, dans lequel Tamorce comporte la sequence suivante : 

5'ACC GCC ACC CCX CC3' ou X est un quelconque nucleotide, ou 
30 5'CCT GCC CCG ACGXTG3' ou X est T ou C. 

8. Procede selon la revendication 4 ou 5, dans lequel au moins une amorce comporte (i) une sequence nucleotidique 
qui code pour la sequence polypeptidique GWG, FGD ou RSC, (ii) le complement de la sequence nucleotidique 
de (i). 
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9. Procede selon la revendication 8, dans lequel Pamorce est un melange d'amorces oligonucleotidiques comportant 
les sequences individuelles representees sur la figure 53. 

10. Procede selon la revendication 1, dans lequel au moins une amorce est selectionnee parmi : 


5 

»CTC 

TAT 

GGC 

AAT 

GAG 

G3', 

5 

1 CGT 

TGG 

CAT 

AAC 

TGA 

T3» , 

5 

1 CGA 

CAA 

GAA 

AGA 

CAG 

A3* , 

5*AGC 

TTC 

GAC 

GTC 

AGA 

T3' , 

5 

•TGA 

ACT 

ATG 

CAA 

CAG 

G3' 7 

5 

1 GGA 

GTG 

TGC 

AGG 

ATG 

G3», 

5 

' AAG 

GTT 

GCA 

ATT 

GCT 

C3\ 

5 

'ACT 

AAC 

AGG 

ACC 

TTC 

G3» , 

5 

'GTA 

ATA 

TGG 

TGA 

CAG 

AGT < 

5 

'GAT 

CTC 

TAG 

AGA 

AAT 

CAA 


et 


11. Amorce polypeptidique s6lectionnee parmi : 
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5'CTC TAT GGC AAT GAG G3 
5'CGT TGG CAT AAC TGA T3 
5 ' CGA CAA GAA AGA CAG A3 
5'AGC TTC GAC GTC ACA T3 
5 ' TGA ACT ATG CAA CAG G3 
5'GGA GTG TGC AGG ATG G3 
5'AAG GTT GCA ATT GCT C3 
5' ACT AAC AGG ACC TTC G3 
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FIG. 1 Translation of DNA 12f 

HePheLysIleArgMetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsn 
1 CCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
6 1 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

LeuI^uThrThrThrGlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeu 
121 TACTGCTG ACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAG 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGG 

SerThrGlyLeuIleHisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyVal 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTCTAACACCTGCACGTCATGAACATGCCCC 


GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 


LeuAlaAspAlaArgValCysSerCysLeuTrpMetMetLeuLeuIleSerGlnAlaGlu 
301 TGCTTG CAG ACGCGCGCG TCTGCTCCTGCTTGTGGATG ATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

Overlap with 14i 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTCCCAG 


Val 
421 TTGTATC 
AACATAG 
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FIG. 2~1 Translation of DNA k9-l 

GlyCysProGluAxgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGly 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
6 1 GCCCTATCAGTTATGCCAACGG AAGCGGCCCCGACCAGCGCCCCTACTGCTGGC ACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCA 
GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT 

ProSerProValValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGG AACG ACCG ACAGGTCGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAAC AACACCCTGC ACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuVaLAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 


TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThrllePheLyslleArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACTATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGATATAAATTTTAGT 


MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAGCACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTCGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 


ArgCysAspLeuGluAspArgAspArgSerGluLeuSerProI^uI^iiLeuThrThrThr 
601 AACGTTGCGATCTGG AAG ATAGGG ACAGGTCCGAGCTCAGCCCGTTACTGCTGACC ACTA 
TTGCAACGCTAGACCTTCTATCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 


GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThKSlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTGCCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGACGGTCGGAACAGGTGGCCGGAGT 
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Overlap with Combined ORF of DMAs 12f through 15e 

HisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlyS rSerlleAla 
721 TCCACCTCCACCAGAACATTGTGG ACGTGC AGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 


SerTrpAlalleLysTrpGluTyrValYalLeuLeuPheLeuLeuLeuAlaAspAlaArg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTCCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAGGAGGACAAGGAAGACGAACGTCTGCGCG 


VaLCysSerCysLeuTrpMetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsn 
84 1 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAAGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTTCGCCGAAACCTCT 


LeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuVal 
90 1 ACCTCGTAATACTTAATCCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 


PhePheCysPheAlaTrpTyrl^uLysGlyLysTrpValProGlyAlaValTyrThrPhe 
961 TGTTCTTCTGCTTTGCATGGTATCTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAGACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 

TyrGlyMetTrpProI^uLeuI^uI^uLeuLeuAlaLeuProGlnArgAlaTyrAlaLeu 
1021 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 

AspThrGluValAlaAlaSerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThr 
1081 TGGACACGGAGGTCGCCGCGTCGTGTGGCGGTGTTCTTCTCGTCGGGXTGATGGCGCTAA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGATT 


LeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeu 
1141 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTC 
GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 

ThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsnValArgGlyGlyArg 
1201 TGACCAGAGIGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 


AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 
1261 GCGACGCTG7CATCTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCA 
CGCTGCGACAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 


LeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAla 
1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAG 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTC 


FIG. 2-2 


54 


EP 0 398 748 B1 


FIG. 3 Translation of DNA 15e 


GlyAlaGlyLysArgValTyrTyrLeuThrArgAspProThrThrProLeuAlaArgAla 
1 CGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGC 
GCCGCGACCTTTCTCCCAGATGATCGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCG 

Overlap with 26g 

AlaTrpGluThrAle^rgHisThrProValAsnSerTrpLeuGlyAsnllelleMetPhe 
6 1 TGCGTCGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTT 
ACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAA 


AlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAla 
121 TGCCCCCACACTGTGGGCGAGG ATGATACTGATG ACCC ATTTCTTT AGCGTCCTTAT AGC 
ACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCG 

ArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGlu 
181 C AGGGACC AGCTTG AACAGGCCCTCG ATTG CG AG ATCT ACGGGGCCTGCTACTC CAT AG A 
GTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCT 

ProLeuAspLeuProProIlelleGlnArgLeu 
24 1 ACCACTTGATCTACCTCCAATCATTCAAAGACTC 
TGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 4 

Translation of DNA 13i 

ProSerProValValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGly 
1CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCTACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGATGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
6 1 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTAtGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
1 21 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
181 TCATCGGAGGGGCGGGC AACAACACCCTGC ACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpLeuThrProArgCysLeuValAsp 
241 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGCTCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCGAGTGTGGGTCCACGGACCAGC 


TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArg 
301 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 


MetTyrValGlyGlyValGluHlsArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
361 GG ATGT ACGTGGGAGGGGTCGAGCACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTCGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

Overlap with 12 f 

AxgCysAspLeuGluAspArgAspArgSerGlul^uSerProI^uLeuLeuThrThrThr 
4 21 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 


GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeu 
481 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTGCCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGACGGTCGGAACAGGTGGCCGGAGT 
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Translation of DNA 26 j FIG, 5 

LeuPheTyrHisHisLysPheAsnSerSerGlyCysProGluArgLeuAlaSerCysArg 
1GCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGAGAGGCTAGCCAGCTGCCG 
CGAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCGACAGGACTCTCCGATCGGTCGACGGC 

ProLeuThrAspPheAspGlnGlyTrpGlyProIleSerTyrAlaAsnGlySerGlyPro 
6 1 ACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTATGCCAACGGAAGCGGCCC 
TGGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAATACGGTTGCCTTCGCCGGG 

AspGlnArgProTyrCysTrpHisTyrProProLysProCysGlylleValProAlaLys 
121 CGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTGCGGTATTGTGCCCGCGAA 
GCTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACGCCATAACACGGGCGCTT 

Overlap with 131 

SerValCysGlyProValTyrCysPheThrProSerProValValVal 
181 GAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTGGTGGTGGG 
CTCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCACCACCACCC 


Translation of DNA CA59a p|Q t § 

LeuValMetAlaGlnLeuLeuArglleProGlnALalleLeuAspMetlleAlaGlyAla 
1 TTGGTAATGGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCT 
AACCATTACCGAGTCGACGAGGCCTAGGGTGTTCGGTAGAACCTGTACTAGCGACCACGA 

HisTrpGlyValLeuAlaGlylleAlaTyrPheSerMetValGlyAsnTrpAlaLysVal 
6 1 CACTGGGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGG AACTGGGCG AAGGTC 
GTGACCCCTCAGGACCGCCCGTATCGCATAAAGAGGTACCACCCCTTGACCCGCTTCCAG 

LeuValValLeuLeuLeuPheAlaGlyValAspAlaGluThrHisValThrGlyGlySer 
121 CTGGTAGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGT 
GACCATCACGACGACGATAAACGGCCGCAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCA 

AlaGlyHisThrValSerGlyPheValSerLeuLeuAlaProGlyAlaLysGlnAsnVal 
181 GCCGGCCACACTGTGTCTGGATTTGTTAGCCTCCTCGC ACCAGGCGCCAAGCAGAACGTC 
CGGCCGGTGTGACACAGACCTAAACAATCGGAGGAGCGTCGTCCGCGGTTCGTCTTGCAG 

GlnLeulleAsnThrAsnGlySerTrpHisLeuAsnSerThrAlaLeuAsnCysAsnAsp 
241 CAGCTGATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGAT 
GTCGACTAGTTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTTACTA 


ScrLeuAsnThrGlyTrpLeuAlaGlyLeuPheTyrHisHisLysPheAsnSerSerGly 
301 AGCCTCAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGC 
TCGGAGTTGTGGCCGACCAACCGTCCCGAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCG 

Overlap with 26 j 


Overlap with K9-1 

CysProGluArgLeuAlaSerCysArgPro 
361 TGTCCTG AGAGGCTAGCCAGCTGCCGACCCC 
ACAGGACTCTCCGATCGGTCGACGGCTGGGG 
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Translation of DNA CA84a FIG. 7 

GlnGlyCysAsnCysSerllcTyrProGlyHisIleThrGlyHisArgMetAlaTrpAsp 
1CGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCGCATGGCATGGG 
GCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACCC 


MetHetMetAsnTrpSerProThrThrAlaLcuValMetAlaGlnLeuLeuArgllePro 
6 1 ATATGATGATG AACTGGTCCCCTACGACGGCGTTGGTAATGGCTCAGCTGCTCCGGATCC 
TATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAGG 


GlnAlalleLeuAspMetlleAlaGlyAlaHisTrpGlyValLeuAlaGlyllcAlaTyr 
1 21 CACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATAGCGT 
GTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCCTCAGGACCGCCCGTATCGCA 

Overlap with CA59a 

PhcSerMetValGlyAsnTrpAlaLysValLeuValValLeuLeuLcuPheAlaGlyVal 
181 ATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGCG 
TAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCGC 


AspAlaGluThrKisValThrGly 
241 TCGACGCGGAAACCCACGTCACCGGGG 
AGCTGCGCCTTTGGGTGCAGTGGCCCC 


Translation of DNA CA156e F\G. & 

CysTrpValAlaMetThxProThrYalAlaThrArgAspGlyLysLeuProAlaThrGln 
1 GTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGACGCA 
CACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTGCGT 

LeuArgArgHisIleAspLeuLeuValGlySerAlaThrLeuCysSerAlaLeuTyrVal 
6 1 GCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCCCTCTACGT 
CGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCGGGAGATGCA 

GlyAspLeuCysGlySerValPheLeuValGlyGlnLeuPheThrPheSerProArgArg 
121 GGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCG 
CCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGAGGGTCCGC 


HisTrpThrThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArg 
181 CCACTGG ACG ACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCG 
GGTGACCTGCTGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGC 

Overlap with CA84a 

MetJaaTrpAspMetMetMetAsnTrpSerProThrThrA^ 

241 CATGGCATGGGATATGATGATG AACTGGTCCCCTACGACGGCGTTGGTAGTGGCTCAGCT 

GTACCGTACCCTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATCACCGAGTCGA 


LeuAr g 1 1 eP r oG InAl a 
301 GCTCCGGATCCCACAAGCC 
CGAGGCCTAGGGTGTTCGG 
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FIG. 9 


Translation of DNA CA167b 

SerThrtlyLeuTyrHisValThrAsnAspCysProAsnSerSerlleValTyrGluAla 
1CTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTACGAGGC 
GAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATGCTCCG 

AlaAspAlalleLeuHisThrProGlyCysValProCysValArgGluGlyAsnAlaSer 
6 1 GGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAACGCCTC 
CCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTGCGGAG 


ArgCysTrpValAlaMetThrProThxValAlaThxArgAspGlyLysLeuProAlaThr 
121 GAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGAC 
CTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTG 

Overlap with CA156e 

GlnLeuArgArgHisIleAspLeuLcuValGlySerAlaThrLeuCysScrAlaLeuTyr 
181 GCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCTACCCTCTGTTCGGCCCTCTA 
CGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGATGGGAGACAAGCCGGGAGAT 


valGlyAspLeuCysGlySerVal PheLeu 
2U CGTGGGGGACTTGTGCGGGTCTGTCTTTCTTG 
GCACCCCCTGAACACGCCCAGACAGAAAGAAC 
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FIG. 10 

Translation of DMA ssCA216a 

ArgArgArgSerArgAsnLeuGlyLysVallleAspThrLeuThrCysGlyPheAlaAsp 
1 CCCGGCGTA6GTC6CGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCG 
GGGCCGCATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGCCGAAGCGGC 

LeuMetGlyTyrlleProLeuValGlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAla 
6 1 ACCTCATGGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCC 

TGGAGTACCCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGACGGTCCCGGGACC 

HisGlyValArgVall^uGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCys 
121 CGCATGGCGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTT 
GCGTACCGCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCTTGGAAGGACCAA 

SerPheSerllePheLeuLeuMaLeuLeuSerCysLeuThrValProAlaSerAlaTyr 
181 GCTCTTTCTCTATCTTCCTTCTGGCCCTGCTCTC 

CGAGAAAGAGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGGCGAAGCCGGA 


GlnValArgAsnSerThrGlyLeuTyrHisValThrAsnAspCysProAsnSerSerlle 
241 ACCAAGTGCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGXA 
TGGTTCACGCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCAT 

overlap with CAl67b 

ValTyrGluAlaAlaAspAlalleLeuHisThrProGlyCysValProCysValArgGlu 
301 TTGTGTACGAAGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTG 
AACACATGCTTCGCCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCAC • 


GlyAsnAlaSerArgCysTrpValAlaMetThrProThrValAla 
361 AGGGCAACGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCC 
TCCCGTTGCGGAGCTCCACAACCCACCGCTACTGGGGATGCCACCGG 
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FIG. 1 1 

Translation of DMA ssCA290a 

LysLysAsnLysArgAsnThrAsnArgArgProGlnAspValLysPheProGlyGlyGly 
1 AAAAAAAAAACAAACGXAACACCAACCGTCGCCCACAGGACGTCAA6TTCCCGGGTGGCG 
TTTTTTTTTTGTTTGCATTGTGGTTGGC 

Gin I leValGlyGlyValTy rLeuLeuProArgArgGlyProArgLeuGlyValArgAla 
6 1 GTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGGGGCC^ 

CAGTCTAGCAACCACCTCAAATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGC 

ThrArgLysThrSerGluArgSerGlnProArgGlyArgArgGlnProIleProLysAla 
121 CGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGTAGACGCCAGCCTATCCCCAAGG 
GCTGCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCATCTGCGGTCGGATAGGGGTTCC 

ArgArgProGluGlyArgThrTrpAlaGlnProGlyTyrProTrpProLeuTyrGlyAsn 
181 CTCGTCGGCCCGAGGGCAGGACC1KMGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCA 
GAGCAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGT 

GluGlyCysGlyTrpAlaGlyTrpLeuLeuSerProArgGlySerArgProSerTrpGly 
24 1 ATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCCCGltKSCTCTCGGCCTAGCTGGG 
TACTCCCGACGCCCACCCGCCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCC 


ProThrAspProArgArgArgSerArgAsnLeuGlyLysVal 1 1 eAs pThrLeuThrCy s 
301 GCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGT 
CGGGGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCA 


GlyPheAlaAspLeuMetGlyTyrlleProLeuValGlyAlaProLeuGlyGlyAlaAla 
361 GCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTG 
CGCCGAAGCGGCTGGAGTACCCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGAC 

overlap with CA216a 

ArgAlaLeuAlaHisGlyValArgValLeuGluAspGlyValAsnTyrAlaThrGlyAsn 
421 CCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGA 
GGTCCCGGGACCGCGTACCGCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCT 


LeuProGlyCysSerPheSerThrPhe 
481 ACCTTCCTGGTTGCTCTTTCTCTACCTTC 
TGGAAGGACCAACGAGAAAGAGATGGAAG 
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Translation of DNA ag30a 
FIG. 12"1 #MetSerValValGlnProProGlyProProLeu 
#MetAlaLeuValOP 

1 CGCA6AAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCC 
GCGTCTTTCGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGG 

ProGlyGluProAM 

61 TCCCGGGAGAGCCA7AGTGGTCTGCGGAACCGGICAGTACACCGGAATTGCCAGGACGAC 
AGGGCCCTCTCGGTATCACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTG 

#MetProGlyAspLeuGlyValProProGlnAsp 

121 CGGGTCCTTTCTTGGATCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGA 
GCCCAGGAAAGAACC7AGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCT 

OP AM GlyAlaCys 
CysAM * 

181 CTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTT 
GACGATCGGCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAA 

I 

GluCysProGlyArgSerArgArgProCysThrMetSerThrAsnProLysProGlnLys 

241 GCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAA 
CGCTCACGGGGCCCTCCAGAGCATCTGGCACGTGGTACTCGTGCTTAGGATTTGGAGTCT 

LysAsnLysArgAsnThrAsnArgArgProGlnAspValLysPheProGlyGlyGlyGln 

301 AAAAAAACAAACGTAACACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTC 
TTTTTTTGTTTGCATTGTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGCCAG 

IleValGlyGlyValTyrLeuLeuProArgArgGlyProArgLeuGlyValArgAlaThr 

361 AGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGA 
TCTAGCAACCACCTCAAATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGCGCT 

ArgLysThrSerGluArgSerGlnProArgGlyArgArgGlnProIleProLysAlaArg 

421 CGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTC 
GCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCATCTGCAGTCGGATAGGGGTTCCGAG 

ArgProGluGlyArgThrTrpAlaGlnProGlyTyrProTrpProLeuTyrGlyAsnGlu 

overlap with CA290a 

481 GTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATG 
CAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCCATGGGAACOGGGGAGATACCGTTAC 


GlyCysGlyTrpAlaGlyTrpLeuLeuSerProArgGlySerArgProSerTrpGlyPro 


54 1 AGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCC 
TCCCGACGCCCACCCGCCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCCCGG 

Thr AspProAr g ArgArgSerAr gAsnLeuGlyLy sVa 1 1 leAspThrLeuThrCysGly 
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601 CCAC^ACCCCCGGCGTAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCG 
GGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGC 

Phe 


661 GCTTC 
CGAAG 


* - Start of long HCV ORF 

| - Putative first amino acid of large HCV polyprotein 

# - Putative small encoded peptides (that may play a 

translational regulatory role) 


FIG. 12-2 
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FIG. 13 

Translation of DNA CA205a 

ValLeuGlyArgGluArgProCysGlyThrAlaOP AM GlyAlaCysGluCysProGly 
1 GTCTTGGGTCGCGAAAGGCCTTGTGGTACTGC^ 

CAGAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCC 

ArgSerArgArgProCysThrMetSerThrAsnProLysProGlnArgLysThrLysArg 
61 AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGT 
TCCAGAGCATCTGGCACGTGGTACTCGTGCTTAGGATT 


AsnThrAsnArgArgProGlnAspValLysPheProGlyGlyGlyGlnlleValGlyGly 
121 AACACCAACCGTCGCCCACAGGACGTCAAGTICCCGGGTGGCGGTCAGATCGTTGGTGGA 
TTGTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGCCAGTC 

ValTyrLeuLeuProArgArgGlyProArgLeuGlyValArgAlaThrArgLysThrSer 
181 GTTTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCC 
CAAATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGG 

overlap with CA290a 

GluArgSerGlnProArgGlyArgArgGlnProIleProLysAlaArgArgProGluGly 
241 GAGCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGC 
CTCGCCAGCGTTCGAGCTCCATCTGCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCG 

ArgThrTrpAlaGlnProGlyTyrProTrpProLeuTyrGlyAsnGluGlyCys 
301 AGGACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGGAATGAGGGCTGCG 
TCCTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGTTACTCCCGACGC 

* - putative initiator methionine codon 
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FIG. 14 

Translation of DHA 18g 


♦ProProOP 

ISerThrMetAsnHisSexProValArgAsnTyrCysLeuHisAlaGluSerValAM Pro 
II^uHisHisGluSerLeuProCysGluGluLeuLeuSerSexArgArgLysArgLeuAla 
1 CTCCACCATGAATCACTCCCCTGTGAGGAACTACTGTCTTCACGCAGAAAGOGTCTAGCC 
GAGGTGGTACTTAGTGAGGGGACACTCCTTGATGACAGAAGTGCGTCTTTCGCAGATCGG 

IMetSerValValGlnProProGlyProProLeuProGlyGluProAM 
MetAlaLeuValOP 

61 ATGGCGTTAGTATGAGTGTCGTGCAGCCTCC^GGACCCCCCCTCCCGGGAGAGCCATAGT 
TACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCTCTCGGTATCA 


121 GGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATC 
CCAGACGCCTTGGCCACTCATGIGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCTAG 

overlap with ag30a 

IMetProGlyAspLeuGlyValProProGlnAspCysAM 

181 AACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGTGT 
TTGGGCGAGTTACGGACCTCTAAACCCXSCACGGGGGCGTTCTGACGATCGGCTCATCACA 

OP AM GlyAlaCysGluCysProGlyArgSer 

241 TGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 
ACCGAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCCTCCA 

ArgArg 

301 CTCGTAGA 
GAGCATCT 

* - Start of long HCV ORF 

I - Putative small encoded peptides (that may 
play a translational regulatory role) 
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FIG. 15 


Translation of DNA 16 jh 

Overlap with 15e — — 

GlyAlaCysTyrSerlleGluProLeuAspLeuProProIloIleGlnArgLeuHisGly 
1 GGGGCCTCCTACTCCATAGAACCACTGGATCTACCTCCAATCATTCAAAGACTCCATGGC 
CCCCGGACGATGAGGTATCTTGGTGACCTAGATGGAGGTTAGTAAGTTTCTGAGGTACCG 

LeuSerAlaPheSerLeuHisSerTyrSerProGlyGlulleAanArgValAlaAlaCyB 
61 CTCAGCGCATTTTCACTCCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGC 
GAGTCX3CGTAAAAGTGAGGTGTCAATGAGAGGTCCACTTTAATTATCCCACCGGCGTACG 

Gly* 
G 

LenArgLyBLeuGlyValProProLeuArgAlaTrpArgHiaArgAlaArgSerValArg 
121 CTCAGAAAACTTGGGGTACCGCCCTTGOGAGCTTGGAGACACCGGGCCCGGAGCGTCCGC 
GAGTCTTTTGAACCCCATGGCGGGAACGCTCGAACCTCTGTGGCCCGGGCCTCGCAGGCG 

AlaArgLeuLeuAlaArgGlyGlyArgAlaAlalleCysGlyLysTyrLeuPheAsnTrp 
181 GCTAGGCTTCTGGCCAGAGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGG 
CGATCCGAAGACCGGTCTCCTCCGTCCCGACGGTATACACCGTTCATGGAGAAGTTGACC 

AlaValArgThrLysLeuLys 
24 1 GCAGTAAGAACAAAGCTCAAAC 
CGTCATTCTTGTTTCGAGTTTG 


* - nucleotide heterogeneity 
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Translation of DNA 6k 

Overlap with 16jh 

GlyArgAlaAlalleCysGlyLysTyrLeuPheAsnTrpAlaValArgThrLysLeuLys 
1 GGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCTCAAA 
CCGTCCCGACGGTATACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGAGTTT 

LeuThrProIleAlaAlaAlaGlyGlnLeuA8pLeuSerGlyTrpPheThrAlaGlyTyr 
61 CTCACTCCAATAGCGGCCGCTGGCCAGCTGGACTTGTCCGGCTGGTTCACGGCTGGCTAC 
GAGTGAGGTTATCGCCGGCGACCGGTCGACCTGAACAGGCCGACCAAGTGCCGACCGATG 

SerGlyGlyAspIleTyrHisSerValSerHisAlaArgProArgTrpIleTrpPheCy8 
121 AGCGGGGGAGACATTTATCACAGCGTGTCTCATGCCCGGCCCCGCTGGATCTGGTTTTGC 
TCGCCCCCTCTGTAAATAGTGTCGCACAGAGTACGGGCCGGGGCGACCTAGACCAAAACG 


181 CC 
GG 


RG.16 
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-341 GCCAGCCCCCTGATGGGGGCGA 
FIG. 18"1 CGGTCGGGGGACTACCCCCGCT 

-319 CACTCCACCATGAATCACTCCCCTGTGAGGAACTACTGTCTTCACGCAGAAAGCGTCTAG 
GTGAGGTGGTACTTAGTGAGGGGACACTCCTTGATGACAGAAGTGCGTCTTTCGCAGATC 

-259 CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GGTACCGCMTCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCTCTCGGTAT 

-199 GTGGTCTGCGGAACCGGTGAGTAG^CCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
CACCAGACGCCTTGGCCACTCATGTCGCCTTMCGGTCCTGCTGGCCCAGGAAAGAACCT 

-139 TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
AGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCGGCTCATCA 

-79 GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCXGATAGGGTGCTTGCGAGTGCCCCGGGAG 
CAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCCTC 

- 19 GTCTCG T AG ACCGTGCACC 
CAGAGCATCTGGCACGTGG 

Arg Thr 

MetSerThrAsnProLysProGlnLysLysAsnLysArgAsnThrAsnArgArgProGln 
1 ATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACG TAACACCAACCGTCGCCCACAG 
TACTCG TGCTTAGG ATTTGGAG TTTTTTTTTTGXTTGCATTGTGGTTGGCAGCGGGTGTC 

AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArg 
61 GACGTCAAGTTCCCGGGTCGCGGTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGG 
CTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCAAATGAACAACGGCGCGTCC 

GlyProArgLeuGlyValArgAlaThrArgLysThrSerGluArgSerGlnProArgGly 
121 GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGT 
CCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCA 

ArgArgGlnProIleProLysAlaArgArgProGluGlyArgThrTrpAlaGlnProGly 
181 AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 
TCTGCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCC 

TyrProTrpProLeuTyrGlyAsnGluGlyCysGlyTrpAlaGlyTrpLeuLeuSerPro 
241 TACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCC 
ATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCGCCCTACCGAGGACAGAGGG 

ArgGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAsnLeuGly 
301 CGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGT 
GCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCA 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMetGlyTyrlleProLeuVal 
361 AAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
TTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTACCCCATGTATGGCGAGCAG 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHlsGlyValArgValLeuGluAsp 
421 GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 
CCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACCGCAGGCCCAAGACCTTCTG 

Thr 

GlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePheLeuLeuAla 
481 GGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTCTCTATCTTCCTTCTGGCC 
CCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAAGAGATAGAAGGAAGACCGG 

LeuLeuSerCysLeuThrValProAlaSerAlaTyrGlnValArgAsnSerThrGlyLeu 
541 CTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGTGCGCAACTCCACGGGGCTT 
GACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCACGCGTTGAGGTGCCCCGAA 
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FIG. 18-2 

TyrHisValThrAsnAspCysProAsnSerSerlleValTyrGluAlaAlaAspAlalle 
601 TACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTACGAGGCGGCCGATGCCATC 
ATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATGCTCCGCCGGCTACGGTAG 

LeuHisThrProGlyCysValProCysValArgGluGlyAsnAlaSerArgCysTrpVal 
661 CTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTG AGGGCAACGCCTCGAGGTGTTGGGTG 
GACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTGCGGAGCTCCACAACCCAC 

Al aMetThrProThrValAlaThr ArgAspGly Ly sLeuProAlaThrG InLeuArgArg 
721 GCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGACGCAGCTTCGACGT 
CGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTGCGTCGAAGCTGCA 

HisIleAspLeuLeuValGlySerAlaThrLeuCysSerAlaLeuTyrValGlyAspLeu 
781 CACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCCCTCTACGTGGGGGACCTA 
G TG TAGCTAGACGAACAGCCCTCGCGGTGGGAG ACAAGCCGGGAG ATGCACCCCCTGG AT 

CysGlySerValPheLeuValGlyGlnLeuPheThrPheSerProArgArgHisTrpThr 
841 TGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCGCCACTGGACG 
ACGCCCAGACAGAAAGMCAGCCGGTTGACAAGTGGAAGAGAGGGTCCX3CGGTGACCTGC 

ThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArgMetAlaTrp 
901 ACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCGCATGGCATGG 
TGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACC 

Val 

AspMetMetMetAsnTrpSerProThrThrAlaLeuValMetAlaGlnLeuLeuArglle 
961 GATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATGGCTCAGCTGCTCCGGATC 
CTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAG 

ProGlnAlalleLexiAspMetlleAlaGlyAlaHlsTrpGlyValLeuAlaGlylleAla 
1021 CCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATAGCG 
GGTGTTCGGTAGAACCTGTACTAGCGACCACX3AGTGACCCCTCAGGACCGCCCGTATCGC 

TyrPheSerMetValGlyAsnTrpAlaLysValLeuValValLeuLeuLeuPheAlaGly 
1081 TATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGC 
ATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCG 

ValAspAlaGluThrHisValThrGlyGlySerAlaGlyHisThrValSerGlyPheVal 
1141 GTCG ACGCGG AAACCCACGTCACCGGGGG AAGTGCCGGCCACACTG TGTCTGG ATTTG TT 
CAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCGGTGTGACACAGACCTAAACAA 

SerLeuLeuAlaProGlyAlaLysGlnAsnValGlnLeuIleAsnThrAsnGlySerTrp 
1201 AGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGATCAACACCAACGGCAGTTGG 
TCGGAGGAGCGTGGTCCGCGGTTCGTCTTGC^GGTCGACTAGTTGTGGTTGCCGTCAACC 

HisLeuAsnSerThrAlaLeuAsnCysAsnAspSerLeuAsnThrGlyTrpLeuAlaGly 
1261 CACCTCMTAGCACGGCCCTGAACTGCAATGATAGCCTCAACACCGGCTGGTTGGCAGGG 
GTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTTGTGGCCGACCAACCGTCCC 

LeuPheTyrHisHisLysPheAsnSerSerGlyCysProGluArgLeuAlaSerCysArg 
1321 CTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGA 
GA^GATAGTGGTGri^ 

ProLeuThrAspPheAspGlnGlyTrpGlyProIleSerTyrAlaAsnGlySerGlyPro 
1381 CCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTATGCCAACGGAAGCGGCCCC 
GGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAATACGGTTGCCTTCGCCGGGG 

AspGlnArgProTyrCysTrpHisTyrProProLysProCysGlylleValProAlaLys 
1441 GACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTGCGGTATTGTGCCCGCGAAG 
CTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACGCCATAACACGGGCGCTTC 
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FIG. 18-3 

SerValCysGlyProValTyrCysPheThrProSerProValValValGlyThrThrAsp 
1501 AGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTGGTGGTGGGAACGACCGAC 
TCACACACACCAGGCCATATAACGMGTGAGGGTCGGGGCACCACCACCCTTGCTGGCTG 

ArgSerGlyAlaProThrTyrSerTrpGlyGluAsnAspThrAspValPheValLeuAsn 
1561 AGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATACGGACGTCTTCGTCCTTAAC 
TCCAGCCCGCGCGGGTGGATGTCGACCCCACTTTTACTATGCCTGCAGAAGCAGGAATTG 

AsnThrArgProProLeuGlyAsnTrpPheGlyCysThrTrpMetA9nSerThrGlyPhe 

1621 MTACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTGGATGMCTCAACTGGATTC 
TTATGGTCCGGTCGCGACCCGTTAACCAAGCCAACATGGACCTACTTGAGTTGACCTAAG 

ThrLysValCysGlyAlaProProCysVallleGlyGlyAlaGlyAsnAsnThrLeuHis 
1681 ACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGCGGGCAACAACACCCTGCAC 
TGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCTCCCCGCCCGTTGTTGTGGGACGTG 

CysProThrAspCysPheArgLysHisProAspAlaThrTyrSerArgCysGlySerGly 
1741 TGCCCCACTG ATTGCTTCCGCAAGCATCCGGACGCCACATACTCTCGGTGCGGCTCCGGT 
ACGGGGTGACTAACGAAGGCGTTCGTAGGCCTGCGGTGTATGAGAGCCACGCCGAGGCCA 

lie 

ProTrpLeuThrProArgCysLeuValAspTyrProTyrArgLeuTrpHisTyrProCys 
1801 CCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAGGCTTTGGCATTATCCTTGT 
GGGACCTAG TG TGGGTCCACGG ACCAGCTG ATGGGC ATATCCG AAACCGTAATAGG AACA 

ThrlleAsnTyrThrllePheLysIleArgMetTyrValGlyGlyValGluHisArgLeu 
1861 ACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTG 
TGGTAG TTG ATGTGGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTG TCCGAC 

GluAlaAlaCysAsnTrpThrArgGlyGluArgCysAspLeuGlxiAspArgAspArgSer 
1921 GAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCC 
CTTCGACGGACGTTGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGG 

GluLeuSerProLeuLeuLeuThrThrThrGlnTrpGlnValLeuProCysSerPheThr 
1981 G AGCTC AGCCCGTTACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACA 
CTCGAGTCGGGCAATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTCT 

ThrLeuProAlaLeuSerThrGlyl^uIleHisLeuHisGlnAsnlleValAspValGln 
2041 ACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAG 
TGGG ATGGTCGGAACAGGTGGCCGGAGTAG GTGG AGGTGGTCTTGTAACACCTG CACGTC 

TyrLeuTyrGlyValGlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValVal 
2101 TACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTT 
ATGAACATGCCCCACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAA 

I^uLeuPheLeuLeuLeuAlaAspAlaArgValCysSerCysLeuTrpMetMetLeuLeu 
2161 CTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTC 
GAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAG 

IleSerGlnAlaGluAlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAla 
2221 ATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATCCCTGGCC 
TATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGG 

GlyThrHisGlyLeuValSerPheLeuValPhePheCysPheAlaTrpTyrLeuLysGly 
2281 GGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTA 

CCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCA 

LysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeuLeuLeuLeuLeu 
2341 AAGTGGGTGCCCGG AGCGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCTCCTG 
TTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGAC 
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FIG. 18-4 " 

LeuMal^uProGlnArgAlaTyrAlaLeuAspThrGluValAlaAlaSerCysGlyGly 
2401 TTGGCGTTGCCCCAGCGGGCGTACGC6CTGGACACGGAGGTGGCCGCGTCGTGTGGCGGT 
AACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGCGCAGCACACCGCCA 

ValValLeuValGlyLeuMetAlaljeuThrLeuSerProTyrTyrLysArgTyrlleSer 
2461 GTTGTTCTCGTCGGGTTCATGGCGCTGACTCTGTCACCATATTACAAGCGCTATATCAGC 
CAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATATAGTCG 

(Asn) 

TrpCysLeuTrpTrpLeuGlnTyrPheteuThrArgValGluAlaGlnLeuHlsValTrp 
2521 TGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACGTGTGG 
ACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACC 

IleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeuMetCysAlaVal 

2581 ATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGTGCTGTA 
TAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACAOGACAT 

HisProThrLeuValPheAspIleThrLysLeuLeuLeuAlaValPheGlyProLeuTrp 
2641 CACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTTTGG 
GTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACC 

IleLeuGlnAlaSerLeuLeuLysValProTyrPheValArgValGlnGlyLeuLeuArg 
2701 ATTCTT(^AGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGG 
TAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCC 

PheCysAlaLeuAlaArgLysMetlleGlyGlyHisTyrValGlnMetValllelleLys 
2761 TTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATCATTAAG 
AAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTC 

I^uGlyAlaLeuThi^lyThrTyrValTyrAsnHisLeuThrProLcuArgAspTrpAla 
2821 TTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGACTGGGCG 
AATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGC 

HisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPheSerGlnMetGlu 
2881 CACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAG 
GTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTC 

ThrLysLeulleThrTrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeu 
2941 ACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTG 
TGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAAC 

ProValSerAlaArgArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSer 
3001 CCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCC 
GGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGG 

LysGlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeu 
3061 AAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTA 
TTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGAT 

GlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGln 
3 121 GGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAG 
CCCACGTATTAGTGGTCGGATTGACCGGCCCTGTMTTGGTTCACC 

IleValSerThrAlaAlaGlnThrPheLeuAlaThrCyslleAsnGlyValCysTrpThr 
3181 ATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACT 
TAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGA 

ValTyrKlsGlyAlaGlyThrArgThrlleAlaSerProLysGlyProVallleGlnMet 
3241 GTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATG 
CAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTAC 
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FIG. 18-5 'ser Thr 

TyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeu 
3301 TATACCAATGTAGACCAAGACCTTGTGGGCTGGCCC6CTCCGCAAGGTAGCCGCTCATTG 
ATATGGTTACAXCTGGTTCTGGAACACCCGACCGGGCGAGGCXaTTCCATCGGCGAGTMC 

ThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAspVallle 
3361 ACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATT 
TGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAA 

ProValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArgProlleSerTyr 
3421 CCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTAC 
GGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATG 

LeuLysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHtsAlaValGlyllePhe 
3481 TTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTT 
AACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAA 

ArgAlaAlaValCysThrArgGlyValAlal.ysAlaValAspPheIleProValGluAsn 
3541 AGGGCCGCGGTGTGCACCCGTGGAGTGGCTMGGCGGTGGACTTTATCCCTGTGGAGAAC 

TCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGG 

LeuGluThrThrMetArgSerProValPheThrAspAsnSerSerProProValValPro 
3601 CTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCC 
GATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGG 

GlnSerPheGlnValAlaHisLeuHlsAlaProThrGlySerGlyLysSerThrLysVal 
3661 CAGAGCTTCCAGGTGGCTCACCTCCATGCTCCC^CAGGCAGCGGCAAAAGCACCAAGGTC 
GTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAG 

ProAlaAlaTyrAlaAlaGlnGlyTyrliysValLeuValLeuAsnProSerValAlaAla 
3721 CCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGXTGCTGCA 
GGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGT 

Leu 

ThrLeuGlyPheGlyAlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThr 
3781 ACACTGGGCTTTGGTGCTTACATG TCCAAGGCTCATGGGATCGATCCTAACATCAGGACC 
TGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGG 

G ly ValAr gThr I leThrThrG ly Ser Pro I leThrTy rSerThrTyrG lyLy s PheLeu 
3841 GGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTT 
CCCCACTCXTGTTAATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAA 

AlaAspGlyGlyCysSerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSer 
3901 GCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCC 
CGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGG 

(Val) 

ThrAspAlaThrSerlleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGly 
3961 ACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGG 
TGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTGTGAGGCCCC 

AlaArgLeuValValLeuAlaThrAlaThrProProGlySerValThrValProHisPro 
4021 GCGAGACTGGTTGTGCTCGCC^CCGCCACCCCTC^ 

CGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGG 

AsnlleGluGluValAlal^uSerThrThrGlyGluIleProPheTyrGlyLysAlalle 
4081 AACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATC 
TTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAG 

ProLeuGluVallleLysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCys 
4141 CCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGC 
GGGG AG CTTCATTAGTTCCCCCCCTCTG TAGAGTAGAAGACAGTAAGTTTCTTCTTCACG 
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FIG. 18-6 

AspGluLeuAlaAlaLysLeuValAlaLeuGlylleAsaAlaValAlaTyrTyrArgGly 
4 201 GACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGT 
CTCCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCA 

LeuAspValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 
4261 CTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTC 
GAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAG 

Tyr 

MetThrGlyTyrThrGlyAspPheAspSerVallleAspCysAsnThrCysValThrGln 
4321 ATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAG 
TACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTC 

(Ser) 

ThrValAspPheSerLeuAspProThrPheThrlleGluThrlleThrLeuProGlnAsp 
4381 ACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGAT 
TGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTA 

AlaValSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArg 
4441 GCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGA 
CGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCT 

PheValAlaProGlyGluArgProSexGlyMetPheAspSerScrValLeuCysGluCys 
4 501 TTTGTGGCACOGGGGGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGC 
AAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACG 

TyrAspAlaGlyCysAlaTrpTyrGluI^uThrProAlaGluThrThrValArgLeuArg 
4561 TATG ACGCAGGCTG TGCTTGG TATG AGCTC ACGCCCGCCGAG ACTACAGTTAGG CTACG A 
ATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCT 

AlaTyrMetAsnThrProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGly 
4621 GCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGC 
CGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCG 

ValPheThzGlyLeuThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnSerGly 
4681 GTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGG 
CAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCC 

GluAsnLeuProTyrLeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaPro 
4741 GAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCT 
CTCTTGGAAGGAATGGACC^TCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGA 

ProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGly 
4 801 CCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGG 
GGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCC 

ProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrKisPro 
4861 CCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCA 
GGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGT 

ValThrLysTyrlleMetThrCysMetSerAlaAspLeuGluValValThrSerThrTrp 
4 921 GTCACCAAATAC^TCATGACATGCATGTCGGCCXSACCTGGAGGTCGTtt 
C^GTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTCC 

ValLeuValGlyGl^alLeuAlaMaLeuAl 

4981 GTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTG 
CACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCAC 

ValtleValGlyArgValValLeuSerGlyLysProAlallelleProAspArgGluVal 
5041 GTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTC 
CAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAG 
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FIG. 18-7 

LeuTyrArgGlufheAspGluMetGluGlu^ 
5101 CTCTACCGAGAGTTCGATGAGATGGAAGAG TGCTCTCAGCACTTACCGTACATCGAGCAA 
GAGATGGCTCTCAAGCTACTCTACCTTCT^ 

GlTMetMetLeuAl&GluGlnPheLysGlnLysMaLeuGlyLeuLeuGlDThrAlaSer 
5161 GGGATGATGCTCGCCGWXAGTTOU^GC^ 
CCCTACTACGAGCGGCTCGTCAAGTTCGTC 

ArgGlnAlaGluVallleMaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPhe 
5221 CGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAA^^ 
GCAGTCOGTCTCCAATAGCGGGGAC^CAGGTCTGGTT^ 

TrpAlaLysHlaMetTrpAsnPhelleSerGlylleGlDTyrLeuAlaGlyLeuSerThr 
5281 TGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACG 
ACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTW 

LeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaValthrSerPro 
5341 CTGCCTGGTAACCCCGCCATTGCTTCATTC^ 

GACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGG 

I^uThrThrSerGlnThrLeuZieuPheAsnlleLeu^ 
5401 CTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTC 
GATTGG TG ATCGGTTTGGGAGG AG AAGTTGTATAACCCCCCCACCCACCG AOGGGTCGAG 

AlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGly 
5461 GCCGCCCCCGGTGCCGCTACTGCCTTTCTC 

CGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCOT 

SerValGlyLeuGlyLysValLcuIleAspIleLeuAlaGlyTyxGlyAlaGlyValAla 
5521 AGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCG 
TCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGC 

(Gly) 

GlyAlaLeuValAlaPheLysIleMetSerGlyGluValProSerThrGluAapLeuVal 
5581 GGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTC 
CCTCGAG AACACCG TAAGTTCTAG TACTCG CCACTCCAGGGG AGGTGCCTCCTGGACCAG 

AsoLeuLeuProAlalleLeuSerProGlyAlaLeuValValGlyValValCysAlaAla 
5641 AATCTACTGCCCGCCATCCTCTCGCCCX^AGCCCTCGTAGTCGGCGTGGTCTGTGCAGCA 
TTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACAOGTCGT 

IleLeuArgArgHlsValGlyProGlyGluGlyAlaValGlnTrpNetAsoArgLeuIle 
5701 ATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATA 
TATGACGCGGCCGTGCAACCGGGCCCGC1X:CCCCGTCACGI^ 

AlaPheAlaSerArgGlyAsnHisValSerProThrHisTyrValProGluSerAspAla 
5761 GCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGC^ 

CGGAAGCGGAGGGCCCCCTTGG TACAAAGGGGGTGCGTG ATCCACGGCCTCTCGCTACGT 

(HisCys) 

AlaAlaArgValThrAlalleLeuSerSerLeuThrValThrGlnte^ 
5821 GCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCAGCXCCTGAGGCGACTG 
CGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGM 

HlsGlnTrpIleSerSexGluCysThrThrProCysSerGlySerTrpLeuArgAspIle 
5881 CACCAG TGGATAAGCTCGGAGTGT ACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATC 
G TGG TCACCT AXTCG AGCCTCACATGGTG AGG TACGAGGCCAAGG ACCG ATTCCCTGTAG 

TrpAspTrpIleCysGluValLcuSerAspPheLysThrTrpLeuLyeAlaLyflLeuMet 
5941 TGGGACTGGATATGCXyAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATG 
ACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGArTCGAGTAC 
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FIG. 18-8 

ProGlnLeuProGlylleProPheValScrCysGlnArgGlyTyrLysGlyValTrpArg 
6001 CCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGA 
GGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCT 

(Val) 

GlyAspGlylleMetHisThrArgCysKisCysGlyAlaGluIleThrGlyHisValLys 
6061 GTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAA 
CACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTT 

AsnGlyThrMetArglleValGlyProArgThrCysArgAsnMetTrpSerGlyThrPhe 
6121 MCGGGACGAroAGGATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTC 
TTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAG 

ProIleAsnAlaX^rThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPhe 
6181 CCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTC 
GGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAG 

AlaLeuTrpArgValSerAlaGluGluTyrValGluIleArgGlnValGlyAspPheHis 
6241 GCGCTATGG AGGG TGTCTGCAG AGG AATATGTGGAG ATAAGGCAGG TGGGGG ACTTCCAC 
CGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTG 

TyrValThrtlyMetThrthrAspAsnLeuLysCysProCysGlnValProSerProGlu 
6301 TACGTGACX^GTATCACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAA 
ATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGrAGCGGGCTT 

PhePheThrGluLeuAspGlyValArgLeuHisArgPheAlaProProCysLysProLeu 
6361 TTTTTCACAGAATTGGACGGGGTGOGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTTG 
AAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAAC 

LeuArgGluGluValSerPheArgValGlyLeuHlsGluTyrProValGlySerGlnLeu 

6421 CTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATACCCGGTAG<^ 

GACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATGGGCCATCCCAGCGTTAAT 

ProCysGluProGluProAspValAlaValLeuThrSerMetLeuThrAfipProSerHis 
6481 CCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATGCTGACTGATCCCTCCCAt 
GGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTACGAGTGACTAGGGAGGGTA 

IleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGlySerProProSerValAlaSer 
6541 ATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATCACCCCCCTCTGTGGCCAGC 
TATTGTCGTCTCCGCCGGCCCGCTTCCAACCGC^ 

SerSerAlaSerGlnLeuSerAlaProSerLeuLyfiAlaThrCysThrAlaAsnHisAsp 
6601 TCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCATGAC 
AGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTACTG 

SerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsn 
6661 TCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGCAAC 
AGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCGTTG 

IleThrArgValGluSerGluAsnLysValVallleLeuAspSerPheAspProLeuVal 

6721 atcaccagggttgagtcagaaaacaaagtggtgattctggactcctto 
_ _ _ Jtagix&tcccaacz^^ 

AlaGluGluAspGluArgGluIleSerValProAlaGluIleLeuArgLysSerArgArg 
6781 GCGGAG G AGGACGAG CGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGGAGA 
CGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCCTCT 

PheAlaGlnAlaLeuProValTrpAlaArgProAspTyrA8nProProLeuValGluThr 
6841 TTCGCCCAGGCCCTGCCCG7TTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACG 
AAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGC 
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FIG. 18-9 

TrpLysLysProAspTyrGluProProValValHisGlyCysProLeuProProProLys 

6 901 TGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCAAAG 

ACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGTTTC 

SerProProValProProProArgLysLysArgThrValValLeuThrGluSerThrLeu 
6961 TCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTA 
AGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGAT 

(Ser) 

SerThrAlaLeuAlaGluLeuAlaThrArgSerPheGlySerSerSerThrSexGlylle 
7021 TCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATT 
AGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAA 

ThrGlyAspAsnThrThrThrSerSerGluProAlaProSerGlyCysProProAspSer 
7081 ACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCC 
TGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGG 

(PheAla) 

AspAlaGluSerTyrSerSerMetProProLeuGluGlyGluProGlyAspProAspLeu 
7141 GACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTT 
CTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAA 

SerAspGlySerTrpSerThrValSerSerGluAlaAsnAlaGluAspValValCysCy9 

7 201 AGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGC 

TCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACG 

SerMetSerTyrSerTrpThrGlyAlaLeuValThrProCyaAlaAlaGluGluGlnLys 
7261 TCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAA 
AGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTT 

I/euProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHisAsnLeuValTyrSerThr 
7321 CTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACC 
G ACGGG TAGTTACG TGATTCGTTGAGCAACG ATGCAGTGGTGTTAAACCACATAAGGTGG 

ThrSerArgSerAlaCysGlnArgGlnLysLysValThrPheAspArgLeuGlnValLeu 

7381 ACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTTGACAGACTGCAAGTTCTG 
TGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAACTGTCTGACGTTCAAGAC 

AspSerHisTyrGlnAspValLeuLysGluValLysAlaAlaAlaSerliysValLysAla 
7441 GACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCT 
CTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGA 

(Phe) 

AsnLeuLeuSerValGluGluAlaCysSexLeuThrProProHisSerAlaXysSerLys 
7501 AACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAG 
TTGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTC 

PheGlyTyrGlyAlaLysAspValArgCysHlsAlaArgLysAlaValThrHisIleAsn 
7561 TTTGGTTATGGGGCAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAAC 
AAACCAATACCCCGTTTTCTGCAGGCAACGGTACGGTCTTTCOGGCATTGGGTGTAGTTG 

SerValTrpLysAspLeuLeuGluAspAsnValThrProIleAspThrThrlleMetAla 
7621 TCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCT 
AGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGA 

LysAsnGluValPheCysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIle 
7681 AAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATC 
TTCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAG 
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FIG. 18-10 

ValPheProAspLeuGlyValArgValCysGluLyaMetAlaLeuTyrAspValValThr 
7741 GTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACA 
CACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGT 

LysLeuProLeuAlaValMetGlySerSerTyrGlyPheGlnTyrSerProGlyGlnArg 
7 801 AAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGG 
TTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCC 

ValGluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAsp 
7861 GTTGAAXTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGAT 
CAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTO^ 

ThrArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyr 
7921 ACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTAC 
TGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATG 

GlnCysCysAspLeuAspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeu 
7981 CAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTT 
GTTACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAA 

(Gly) 

TyrValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArg 
804 1 TATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGC 
ATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCG 

AlaSerGlyVall^uThrThrSer^sGlyAsnThrLeuThrCysTyrlleLysAlaArg 
8101 GCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGG 
CGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCC 

AlaAlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeu 
8161 GCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTA 
CGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAAT 

ValVallleCysGluSerAlaGlyValGlnGluAspAlaAlaSerLeuArgAlaPheThr 
8221 GTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACG 
CAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGC 

GluAlaMetThrArgTyrSerAlaProProGlyAspProProGlnProGluTyrAspLeu 
8281 GAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTG 
CTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAAC 

GluLeuIleThrSerCysSerSerAsnValSerValAlaHisAspGlyAlaGlyLysArg 

8341 GAGCTCATAACATCATGCTCCTCCAAOGTGTCAGTCGCCCACGACGGCGCTGGAAAGAK 
CTCGAG TATTGTAGTACGAGGAGG TTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTCC 

ValTyrTyrLeuThrArgAspProThrThrProLeuAlaArgAlaAlaTrpGluThrAla 
8401 GTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCA 
CAGATGATGGAGTCGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGT 

ArgHisThrProValAsnSerTrpLeiiGlyAsnllelleMetPheAlaProThrLeuTrp 
8461 AGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGG 
TCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACC 


AlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGlu 
8521 GCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAA 
CGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTT 

GlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuPro 
8581 CAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCT 
GTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGA 
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ProIlelleGlnArgLeuHisGlyLeuSerAlaPheSerLeuHisSerTyrSerProGly 
8641 CCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTACTCTCCAGGT 
GGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGAGGTGTCAATGAGAGGTCCA 

GluIleAsnArgValAlaAlaCysLeuArgLysLeuGlyValProProLeuArgAlaTrp 
8701 GAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGTACCGCCCTTGCGAGCTTGG 
CTTTAATTATCCCACCGGCGTACGGAGTCTTTTGAACCCCATGGCGGGAACGCTCGAACC 

Gly 

ArgHisArgAlaArgSerValArgAlaArgLeuLeuAlaArgGlyGlyArgAlaAlalle 
8761 AGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG AGGAGGCAGGGCTGCCATA 
TCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTCTCCTCCGTCCCGACGGTAT 

CysGlyLysTyrLeuPheAsnTrpAlaValArgThrLysLeuLysLeuThrProIleAla 
8821 TGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCTCAAACTCACTCCAATAGCG 
ACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGAGTTTGAGTGAGGTTATCGC 

AlaAlaGlyGlnLeuAspLeuSerGlyTrpPheThrAlaGlyTyrSerGlyGlyAspIle 
8881 GCCGCTGGCCAGCTGGACTTGTCCGGCTGGTTCACGGCTGGCTACAGCGGGGGAGACATT 
CGGCGACCGGTCGACCTGAACAGGCCGACCAAGTGCCGACCGATGTCGCCCCCTCTGTAA 

(Pro) 

TyrHisSerValSerHisAlaArgProArgTrpIleTrpPheCysLeuLeuLeuLeuAla 
8941 TATCACAGCGTGTCTCATGCCCGGCCCCGCTGGATCTGGTTTTGCCTACTCCTGCTTGCT 
ATAGTGTCGCACAGAGTACGGGCCGGGGCGACCTAGACCAAAACGGATGAGGACGAACGA 

Al aGlyValGly IleTyrLeuLeuProAsnArgOP 
9001 GCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAACACTCCGGCCT 
CGTCCCCATCCGTAGATGGAGGAGGGGTTGGCTACTTCCAACCCCATTTGTGAGGCCGGA 


FIG. 18-11 
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:42.16.XT1 

GGTAGGGTCAAGGCTGAAATCGACTGTCTGCTTCTTTGGAGAAAGTGGTG 
:42.17.XT1 

ATCCTGGGGGAGCGTGATTGTCTCAATGGTCTTCTTTGGAGAAAGTGGTG 
:42.18.XT1 

AGTCCTGCCCCGACGTTGAGTGCGGGAGACCTTCTTTGGAGAAAGTGGTG 
:42.19.XT1 

CACAAATCTGTAGATGCCTGGCTTCCCCCTCTTCTTTGGAGAAAGTGGTG 
:42.20.XT1 

GTCGAACATGCCGGAGGGGCGCTCCCCCGGCTTCTTTGGAGAAAGTGGTG 
:42.21.LLA2C 

GCCTGCGTCATAGCACTCACAGAGGACGGATTAGGCATAGGACCCGTGTC 
:42.22.LLA2C 

AGTCTCGGCGGGCGTGAGCTCATACCAAGCTTAGGCATAGGACCCGTGTC 
:42.23.LLA2C 

CGGGGTGTTCATGTACGCTCGTAGCCTAACTTAGGCATAGGACCCGTGTC 
:42.24.LLA2C 

AAATTCAAGATGGTCCTGGCACACGGGAAGTTAGGCATAGGACCCGTGTC 
:42.25.LLA2C 

TATATGAGTGAGGCCTGTAAAGACGCCCTCTTAGGCATAGGACCCGTGTC 
:42.26.LLA2C 

ACTCTGCTTTGTCTGGGATAGAAAGTGGGCTTAGGCATAGGACCCGTGTC 
:42.27.LLA2C 

TTGGTACGCTACCAGGTAAGGAAGGTTCTCTTAGGCATAGGACCCGTGTC 
:42.28.LLA2C 

GGGAGGGGCTTGAGCCCTAGCGCACACGGTTTAGGCATAGGACCCGTGTC 
:42.29.LLA2C 

AATCAAACACTTCCACATCTGGTCCCACGATTAGGCATAGGACCCGTGTC 
:42.30.LLA2C 

GGGTGTTGGCCCATGGAGGGTGGGCTTGAGTTAGGCATAGGACCCGTGTC 


:42.31.LLA2C 

TTCATTCTGAACAGCGCCCAGTCTGTATAGTTAGGCATAGGACCCGTGTC 

FIG. 19-1 
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.-42.XT1.1 

TCCTCACAGGGGAGTGATTCATGGTGGAGTCTTCTTTGGAGAAAGTGGTG 
:42.XT1.2 

ATGGCTAGACGCTTTCTGCGTGAAGACAGTCTTCTTTGGAGAAAGTGGTG 
:42.XT1.3 

TCCTGGAGGCTGCACGACACTCATACTAACCTTCTTTGGAGAAAGTGGTG 
:42.XT1.4 

CGCAGAC^CTATGGCTCTCCCGGGAGGGGCTTCTTTGGAGAAAGTGGTG 
:42.XT1. 5 

TCGTCCTGGCAATTCCGGTGTACTCACCGGCTTCTTTGGAGAAAGTGGTG 
:42.LLA2C.6 

GCATTGAGCGGGTTGATCCAAGAAAGGACCTTAGGCATAGGACCCGTGTC 
•.42.LLA2C.7 

AGCAGTCTTGCGGGGGCACGCCCAAATCTCTTAGGCATAGGACCCGTGTC 
:42.LLA2C.8 

ACAAGGCCTTTCGCGACCCAACACTACTCGTTAGGCATAGGACCCGTGTC 
.•42.LLA2C.9 

GGGGCACTCGCAAGCACCCTATCAGGCAGTTTAGGCATAGGACCCGTGTC 
:42.LLA2.10 

CGTGCTCATGGTGCACGGTCTACGAGACCTTTAGGCATAGGACCCGTGTC 
:42.LLA2C.ll 

GTTACGTTTGTTTTTTTTTTGAGGTTTAGGTTAGGCATAGGACCCGTGTC 
:42.LLA2C12 

CGGGAACTTGACGTCCTGTGGGCGACGGTTTTAGGCATAGGACCCGTGTC 
:42.LLA2C.13 

CAAGTAAACTCCACCAACGATCTGACCGCCTTAGGCATAGGACCCGTGTC 
:42.LLA2C.14 

GCGCACACCCAATCTAGGGCCCCTGCGCGGTXAGGCATAGGACCCGTGTC 
:42.LLA2C.15 

AGGTTGCGACCGCTCGGAAGTCTTTCTCGTTTAGGCATAGGACCCGTGTC 

FIG. 19-2 
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:42.32.XT1 

ATGTTGGGATGGGGCACAGTGACGGAGCCCCTTCTTTGGAGAAAGTGGTG 
•42.33.XT1 

ATCTCTCCGGTGGTGGACAGAGCAACCTCCCTTCTTTGGAGAAAGTGGTG 
•42.34 ,XT1 

acttcgagggggatagccttgccgtaaaaacttcittggagaaagtggtg 

;42,35.XT1 

TCACAGAAGATGAGATGTCTCCCCCCCTTGCTTCTTTGGAGAAAGTGGTG 
:42.36.LLA2C 

tttgcggcgagttcgtcgcacttcttctttttaggcataggacccgtgtc 

:42.37.LLA2C 

TAGGCCACGGCATTGATGCCCAATGCGACCTTAGGCATAGGACCCGTGTC 
:42.38.LLA2C 

GTCGGGATGACGGACACGTCAAGACCGCGGTTAGGCATAGGACCCGTGTC 
:42.39.UA2C 

GCATCGGTTGCCACGACGACAACATCGCCGTTAGGCATAGGACCCGTGTC 
:42.40.LLA2C 

GAGTCGAAGTCGCCGGTATAGCCGGTCA1GTTAGGCATAGGACCCGTGTC 
:42.41.ILA2C 

GTCTGGGTGACACACGTATTGCAGTCTATCTTAGGCATAGGACCCGTGTC 
:42.42.LLA2C 

ATGGTGAAGGTAGGGTCAAGGCTGAAATCGTTAGGCATAGGACCCGTGTC 
:42.43.LLA2C 

GAGACAGCATCCTGGGGGAGCGTGATTGTCTTAGGCATAGGAGCCGTGTC 

FIG. 19-3 
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FIG. 22 Translation of DNA 81 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMet 
lGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATCAGA^ 
CAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTA 

GluGluCysSerGlnHlsLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
6 1 GGAAGAGTGCTCTCAGCACTTACCGTACATCG AGCAAGGGATGATGCTCGCCGAGCAGTT 
CCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAA 

LysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPro 
121 CAAGCAGAAGGCCCTCGGCCTCCTGCAG ACCGCGTCCCGTCAGGCAGAGGTTATCGCCCC 
GTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGG 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHlsMetTrpAsnPhe 
181 TGCTGTCCAG ACCAACTGGCAAAAACTCGAGACCTTCTGGGCG AAGCATATGTGGAACTT 
ACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAA 

IleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAla 
24 1 CATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGC 
GTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACX* 

SerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
301 TTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTT 


FIG. 23 Translation of DNA 36 

AspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLcuProTyrLeuValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTCGAAGGAATGGACCATC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrp 
6 1 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTCGTCTACACC 

LysCysLeuXleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArglieu 
121 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
181 GGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

HetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAla 
241 ATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 


AlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeu 

301 GCTTTGGCQjCGTATTGCCTGTCAAGAGGCTGC 

CGAAACCGGOGCATAACGGACAGTTGTCCGACGGACCAGTATCACCCGTCCCAGCAGAAC 

Overlap with 81 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArg 
361 TCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTC 
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FIG. 24 Translation of DNA 37b 


LeuAlaAlaLysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAsp 
1 CTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGAC 
GAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTG 


ValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThr 
6 1 GTGTCCGTCATCCCGACCAGCGGCGATCTTGTCGTCGTGGCAACCGATGCCCTCATGACC 
CACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGG 


GlyTyrThrGlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrVal 
121 GGCTATACCGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTC 
CCGATATGGCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAG 

Overlap with 

AspPheSerLeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaVal 
181GATTTCAGCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTC 
CTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAG 

clone 35 

SerArgThrGlnArgArgGlyArgThr 
24 1 TCCCGCACTCAACGTCGGGGCAGGACTG 
AGGGCGTGAGTTGCAGCCCCGTCCTGAC 
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FIG. 25A 
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FIG. 26A 
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Translation of DNA CA84a 

GlnGlyCysAsnCysSexIleTyrProGlyKisIleThrGlyHisArgMetAlaTrpAsp 
1 CGCAAG6TTGCAATTGCTCTATCTATCCCG6CCATATAACGG6TCACCGCATG6CATGGG 
GCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACCC 


MetMetMetJ^snTrpSerProThrThrAlal^uValMetAlaGlnLeul^uArgllePro 
6 1 ATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATGGCTCAGCTGCTCCGGATCC 
TATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAGG 


GlnAlalleLeuAspMetlleAlaGlyAlaHlsTrpGlyValLexiAlaGlylleAlaTyr 
121 CACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATAGCGT 
GTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCCTCAGGACCGCCCGTATCGCA 

Overlap with CA59a 

PheSerMetValGlyAsnTrpAlaLysValLeuValValLeuLeuLeuPheAlaGlyVal 
181 AT7TCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGCG 
TAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCGC 


AspAlaGluThrHisValThrGly 
241 TCGACGCGGAAACCCACGTCACCGGGG 
AGCTGCGCCTTTGGGTGCAGTGGCCCC 


na 27 
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Translation of DMA 40b 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 GGCTTACATGTCCAAGGCTGVTGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAAT 
CCGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
6 1 TACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCG AG ACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaThrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
24 1 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
301 TGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 


LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

Overlap with 37b 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACT ACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 


IleProThr 

481 CATCCCGACCAG f\Q 28 

GTAGGGCTGGTC 
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FIG. 29 
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HCV cDNA OF CLONE 40a 


GluPheGlyAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCysHts 
1 GAATTCGGGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCT6TCAT 
CTTAAGCCCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACAGTA 


6 1 TCAAAGAAGAAGTGCGACGAACTCGCCGCAMGCTGGTCGCATTGGGCATCAATGCCGTG 
AGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCAC 


121 GCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGTGATGTTGTCGTCGTG 
CGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCACTACAACAGCAGCAC 


181 GCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAAT 
CGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTA 


241 ACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACTATTGAGACAATC 
TGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGATAACTCTGTTAG 


FIG. 32 

301 ACGCTCCCCCAAGATGCTCCGAATTC 
TGCGAGGGGGTTCTACGAGGCTTAA6 
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FIG. 34 Translation of DMA 35 

SerlleGluThrlleThrLeuProGlnAspAXaValSerArgThrGlnArgArgGlyArg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
61 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGC 
TG ACCG TCCCCCTTCGGTCCGT AG ATGTCT AAACACCG TGGCCCCCTCGCGGGG AGGCCG 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
121 ATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTC 
TACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
181 ACGCCCGCCG AG ACTACAGTTAGGCTACG AGCGTACATG AACACCCCGGGGCTTCCCGTG 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCAC 


CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHlsIleAspAla 
241 TGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCC 
ACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 


HlsPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAA 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTT 

Overlap vlth 36 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGT 
CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 


LeulleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCT 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 
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NANB 


19 

9 

NANB 

N/A 

20 

10 

NANB 

N/A 

21 

11 

NANB 

N/A 

22 

12 

Normal 

N/A 

23 

13 

Normal 

N/A 

24 

14 

Normal 

N/A 

26 

30174 

Normal 

N/A 

27 

30105 

Normal 

N/A 

28 

30072 

Normal 

N/A 

29 

*i A A O C 

30026 

Normal 

N/A 

30 

30146 

Normal 

N/A 

31 

30250 

Normal 

N/A 

32 

30071 

Normal 

N/A 

33 

15 

AcuteHAV 

N/A 

34 

16 

AcuteHAV 

N/A 

35 

17 

AcuteHAV 

N/A 

36 

18 

AcuteHAV 

N/A 

37 

48088 

AcuteHAV 

N/A 

38 

47288 

AcuteHAV 

N/A 

39 

47050 

AcuteHAV 

N/A 

40 

46997 

AcuteHAV 

N/A 

41 

19 

Convalescent HBV 

N/A 

42 

20 

(anti-HBSag+ve $ 

N/A 

43 

21 . 

anti-HBCag+ve) 

N/A 

44 

22 

(anti-HBSag+ve; 

N/A 

45 

23 

anti-HBCag+ve) 

N/A 

_ _46 

24 

^(anti^HBSag+ve; 

N/A 

47 

25 

anti-HBCag+ve) 

N/A 

48 

26 

(anti-HBSag+ve j 

N/A 

49 

27 

anti-HBSag+ve) 

N/A 


^Sequential serum samples were assayed from these patients 
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FIG. 41 
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FIG. 43 
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Human 23 " FIG. 46 


GlyPheXlaJUpLeuMetGlyTTrllftProLeuValGlyAlaProLeuGlyClyXrgAla 
1 GGCTTCGCCGACCTCATGGGGTACAXXCCGCTCGTCGGCGCCCCTC77GGAGGCCGTGCC 

Ar^AlaLeuidaHisGlyValArgVallAuGluAspGlTValAsoTTrM&ThrGlyAsa 
6 1 AGGGCCCTGGCGCACGGCGTCCGGGTTTTGGAAGACGGCGTGAACTATGCAACAGGGAAC 
CG A 

I^uPrcClyCysSerPheSerIlePheI^uLeuAlaLeuL«uSexCysLeuThrValPro 
121 CTTCCTGGTT GC T CC TTTTCTATCTTCCTTCTGGCCCTACTCTCTTGCCTGACCGTGCCC 

GA T 

AlaSeriaaTyTGlnValArgAsDSftxThrGlylAuTyrKisValThrAjnAspCysPro 
181 GCTTCAGCCTACCAAGTGCGGUkCTCTACGGGGCTTtACCATGTCACCAATGATTGCCCT 

AanSerSarlleValTyzGluAlaAlaAspAlalleLeuHlsAlaProGlyCyaValPrp 

24 1 AACTCGAStATTGTGXACGAGGCGGCCGATGCCATCCTGCACGCTCCGGGGTGTGTCCCT 

T C 

CyaValArgGluAjpAaaValSerArgCyaTrpValUaValThrProThrValAlaThr 
301 TGCGTTCGCGAGGATAACGTCTCGAGATGTTGGGTGGCGGTGACCCCCACGGTGGCGACC 
G T 

LysAapGlyLysLeuProThrThrGlnLeuArgArgHisIleABpLeuLeuValGlySer 
361 AAGGACGGCAAACTCCCCACAACGCAGCTTCGACGTGAGATCGATCTGCTTGTCGGGAGC 

C A 

AlaThrLeuCysSerAla2>uTyrValGlyAspLeuCysGlySerIlePheLeuvalGly 
421 GCCACCCTCTGCTCGGCCCTC7ACGTGGGGGACC7TTGCGGGTCCATCTTTCTTGTCGGT 
T 

GfciLeuPhaThrPheSerProArgArsHiaT^ 
481 CAACTGTTTACCTTCTCTCCCAGGCGCCACTGGACGACGCAGGAC'TGCAACtGnCXATC 

C 

TyrProGlyHisileThrGlyHisArsHatAlaTrpAspte^ 
541 TATCCCGGCCAXATAACGGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCT 

G 

ThrAlaAlaI*uValValAlaGlnlA*iI*uArgIlePr^ 
601 ACGGCGGCAITGGTAGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACAXGAXC 
G AG 

AlaGlyAlatasTrpGlyValLeuAlaGlyMetAlaTyrPheSerMetValGlyAsnTrp 
661 GCTGGTGCTCXCTGGGGAGTCCTGGCGGGCA7GGCGXA7TTC7CCATGG7GGGGAACTGG 

G 

AlaLytVallAuValValLeulAuXAuPheAIaGlyValAspAlaGluThrHlsAz^Thr 

721 GCGAAGGTCCTGGTAGTGCTGCTTCTAXTTGCCGGCGTCGACGCGGAAACCCACOGtACC 

G 

GlyGlySexAIaAlaArgSerThrAlaGlyvalAlaSerLeuPheThrProGlyAlaAr? 
781 GGGGGAWTGCCGCCCGCAGCACGGCTGGAGTTGCTAGTCTCTTCACAC 
C T A 

GlnAan IleGlnLeulleAsnThr AanGly SertrpHi a IleAanSarThr AlaLeuAan 
841 CAGAACATCCAGCTGATCAACACCAACGGCAGTTGGCACATGAATAG1ACGGCCT7GAAC 

At 

CyaAanA»pSerl*uThrthrClyTrpI*\jaaG^ 

90 1 TGCAAXGACAGCCTXACCACCGGCTGGTSAGCGGGGCTnTCXATCACCAXAAASTCAAC 
A A 

SarSexGlyCy«PrcGlxiArgI^uAlaS«rCyaAr^ProlAuThxAapPh^aGln 
961 TCTTCAGGCTGTCCCGAGAGGTTGGCCAGCTGCCGACCCCTCACCGATRTGCCCA^ 
G A G 
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Human 27 

FIG. 47 

GlyPhexlaAjplAuMfttGlTTTrlleProI^uValGlyAIaProlAuGlyGlyAlaAla 
1 GGCTTCGCCGACCTCATGGGGTACATTCCGCTCGTCGGCGCTCCTCTTGGGGGCGCTGCC 

ArgAlaX^uAlaHisGlyValAz^Vall^uGluJlapGlyValASQTTrXlaThrGlyA^n 
6 1 AGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAAC 

I^uProGl7C7sSerPheS€rXlePhelAUl«uAlalAuLeuSerC7sLeuThrValPro 
121 CTTCCTGGTTGCTCTTTCTCTATCTTCCTTCTGGCTCTGCTCTCCT 

AlaSerAlaTyrGlnValArgAanSerSerGlytt^ 

181 gcatcggcctaccaagtacgcaactcctc^ 

AsnSerSerlleValTyrGluThrAlaAspThrlleLeuHisSerProGlTCyaValPro 
241 aattcgagtattgtgtacgagacggccgacaccatc^ 
c c 

CysValArgGluGlyAsijaaSerLysCysTrpValProValAlaProThrValAlaThr 
3 0 1 TGCGTTCGCGAGGGTAACGCCTCGAAATGTTGGGTGCCGGTAGCCCCCACAGTGGCCACC 
G 

AxgAspGlyAsziZ^uProAlaThxGlJilAuAx^ArgKlsIleAspLeuLeuValGlySer 
361 AGGGACGGCAACCTCCCCGCAACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGT 

G G 

AlaThrLeuCysSerAlaLeuTyrValGlyAspLeuCysGlySerValPheLeuValGly 
421 GCCACCCTTTGCTCGGCCCTCTATGTGGGGGACriGTGCGGGTCTGTCTTTCTTGTCGG? 
C C 

GlnX^uPheThrPheSerProArgArgKisTrpThrThrGlnAspCysAsnCysSerlle 
481 CAACTGTTCACTTTCT C CCCCAGGCGCCACTGGACAACGCAAGATTGCAACTGCTCTATC 

A 

TyrProGlyKisIleThrGlyHisArgMetAlaTrpAspHetMetHetAsnTrpSerPro 
541 TACCCCGGCCATATAACGGGACACCGCATCGCATGGCATATGATGATGAACTGGTCCCCT 

ThrAlaAlaLeuValMetAlaGlnLeuI^uArg^ 
601 ACAGCAGCGCTGGXAATGGC7CAGCTGC7CAGGATCCCGCAAGCCATCTTGGACATGATC 
G 

AlaGlyAlaHisTrpGlyValLeuAlaGlylleAlaTyrPheSerMetValGlyAsnlrp. 
661 GCTGGTGCTCACTGGGGAGTCC7AGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGG 

AlaLysValLeuValVall*uI*uI*uPhe^ 
721 GCGAAGGTCCTGGTGGTGCTGTTGCTGTTTGCCGGCGTCGATGCGACAACCTATACCACC 


781 


GlyGlyAsnAlaAlaArgThrThrGlnAlaLeuThrSerPhePheSerProGlyAlaLys 


GlnAsplleG2j*I*ulleAsaThrAsnGlySerT^ 
64 1 GAGGA7ATCCAGCTGATCAACACCAACGGCAGTTGGCACATCAATCGCACGGCCTTGAAC 
G T 

CysAsnAlaSerlAuAspThrGlyTrpValAlaGlyl*^ 
901 TGTAATGCGAGCCTCGACACTGGCTGGGTAGCGGGGCTCTTCTATTACCACAAATtCAAC 
T G 

SerSerGlyCysProGluArgMetAlaSerCysArgProLeuAlaAapPhftAspGIn 
961 TCTTCAGGCTGCCCCGAGAGGATGGCCAGCTGTAGGCCCCTTGCCGACT 

C 
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10 20 30 40 

EC10 GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 

X:::::::::::::::::::::::::::::::::::::::s:: 
HCV1 CTCTCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 
550 560 570 580 590 600 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGMCTCGTCCCCTACGACGGCGTTAGT 

*"* •*** , ***!*"**»!I*»*J*!!t!t!»5tJI*!»!J2*2t!5 2 2 

WICG^^CCX^T^^^ 

. 610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTC 

AATOGCT^GCTCCTCCGGATCC^CAAGCCA 

670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 

GGGAGTCCTC<XGGG<^TAGCCTACT 

730 740 750 760 770 780 

230 240 250 260 270 280 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 

2222222*222*222222222*222222222**2222*22i***2 2*222 * 2 2 2 * 

AGTCCTCCTCCTA!^^ 

790 800 810 820 830 840 

290 300 310 320 330 340 

CAAAACTACGGCTAGCCTTACTGGTCTCTTCAATTTAGGTGCCAAGCAGAACATCCAGCT 

• • } 2 • s 2 2 S 2 2 ! • 2 2 2 2 2 2 2* t'2-222221222 2 2 2 2 2 2 * 

CCACACTCTGTCTG^TTTCTlb^ 

850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTGCAATGATAGCCT 

22 22*i*2222222?22****2*t22 2 2 2 2 it 2 2 2 2 * 2 2S2222222*'2*2222* a 2 

GATCAAi^CCAACGG^G'n^CACC'K^TAGCAC 

910 920 930 940 9S0 960 

'410 420 

CAACACCGGCTGGAATTC 
:::: t :::::: :X 

CAACACCXK^TGGTTGGCAGGGCTTTTCTATCACCACAAGrTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG. 50-1 
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AA #117-308 (putative envfllora region) 

1) HOT #18 (USA) 3 clones sequenced ^* ^ 

2) JH23 (USA) ? 

3) JH 27 (USA) ? 

4) PBL-Th (USA) 2 clones sequenced 

5) EC1 (Italy) 3 clones sequenced 

6) HCV-1 (chimpanzee) multiple 

1) (» 

2) 

3) 

4) 
5) 

6>RNLGKVtDTLTCGFADl^QYIPLVGAPLGGAARALAHQVRNO£DGVNYATGN^ 

1) H 
2) 

3) S T T 

4) I 

5) (F) s 

6) PGCSFSIFLUUJ.SCLTVPASAYQVRNSTGLYHVTNDCPNSSIVYEAADAILH 

1) (H) v V T 

2) A D V V k T 

3) S PVA N 

4) A ART 

5) H V T 
e)TPGCVPC\^EGNASRCWVA/^TVATRDGKlJ s ATCXJRmiDLi.VGSATLCS 

D 

2) I D 

3) 0 
4) 

5) I 

6) ALVVGCXCGSmVGCLFTfSPRRHWrrCX3CNC^I 
SUMMARY: "S" AA1 17-308 (93X) 

HCTI18, PBL-Th, EC1 (Italy) have 97* homology with HCV-1 

JH23 and JH 27 have 96X and 951 horn logy with HCV-1 .respectively 
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AA#300-438 I C-terminal raoion of tho putative envelope region and 

amino -1ft of NSI) 


1) JH23 

2) JH27 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 


? 
? 
? 

2 clones sequenced 
(one nt difference, which did not 
result In an amino acid change) 
multiple 

S < | > NS1 
A V 


5) HCV-1 (chimpanzee) . 

1) D 

2) D 
3) 
4) 

5)TTQGCNCSrYPGHITGHRMAWDMMMNW^ 


vs 


VM V 


1) M R ARSTA VA 

2) T YT N AR TQALT F 

3) L V I M GH R VQ VT TLT 

4) A I AK TASLTA 

5) HWGVL^GIAYFSMVGNWAXVLVVLLLFAGVDAETHVTGGSAGHTVSGFVSL 


I 


A 


1 )FS R I 

2) FT 01 I R 

3) FR S Kl V I R 

4) FNL I I R N 

5) LAPGAKQfWQLIrmK^WHLNSTAIJ4CNDSlNTGVVL 


T 
D 

Q 


SUMMARY: NS 1 AA 330-660 

"Isolate" Homology (AA330-438) 

JR23 83 

_ JH27 _80 

Japanese 73 
EC10 (Italy) 84 


ZHoraology (AA 383-405) 
57 

39 

48 
48 


FIG. 52 
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FIG. 53 

Name Common Sequence Variable Sequence 


5 ' - 3 - 1 AAGCTTGATCGAATTC CGATCTTGC 

-2 CGATCCTGC 

-3 CGATCATGC 

-4 CGATCGTGC 

-5 CGAAGTTGC 

-6 CGAAGCTGC 

-7 AGATCTTGC 

-8 AGATCCTGC 

-9 AGATCATGC 

-10 AGATCGTGC 

-11 AGAAGTTGC 

-12 AGAAGCTGC 

-13 CGATCTTGT 

-14 CGATCCTGT 

-15 CGATCATGT 

-16 CGATCGTGT 

-17 CGAAGTTGT 

-18 CGAAGCTGT 

-19 AGATCTTGT 

-20 AGATCCTGT 

-21 AGATCATGT 

-22 AGATCGTGT 

-23 AGAAGTTGT 

-24 AGAAGCTGT 

-25 CGCTCTTGC 

-26 CGCTCCTGC 

-27 CGCTCATGC 

-28 CGCTCGTGC 

-29 CGCAGTTGC 

-30 CGCAGCTGC 

-31 CGCTCTTGT 

-32 CGCTCCTGT 

-33 CGCTCATGT 

-34 CGCTCGTGT 

-35 CGCAGTTGT 

-36 CGCAGCTGT 
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